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Abstract. We establish nonlinear stability and asymptotic behavior of traveling periodic waves of 
viscous conservation laws under localized perturbations or nonlocalized perturbations asymptotic 
to constant shifts in phase, showing that long-time behavior is governed by an associated second- 
order formal Whitham modulation system. A key point is to identify the way in which initial 
perturbations translate to initial data for this formal system, a task accomplished by detailed esti- 
mates on the linearized solution operator about the background wave. Notably, our approach gives 
both a common theoretical treatment and a complete classification in terms of "phase-coupling" 
or "-decoupling" of general systems of conservation or balance laws, encompassing cases that had 
previously been studied separately or not at all. At the same time, our refined description of solu- 
tions gives the new result of nonlinear asymptotic stability with respect to localized perturbations 
in the phase-decoupled case, further distinguishing behavior in the different cases. An interesting 
technical aspect of our analysis is that for systems of conservation laws the Whitham modulation 
description is of system rather than scalar form, as a consequence of which renormalization methods 
such as have been used to treat the reaction-diffusion case in general do not seem to apply. 



Keywords: periodic traveling waves; balance and conservation laws; asymptotic stability. 

2010 MSG 35B40, 35B10, 35B35, 35L65. 

Contents 

1. Introduction 2 

1.1. Slow modulation behavior 4 

1.2. Setting and preliminary observations 6 

1.3. Results and implications 9 

1.4. Examples 12 

1.5. Discussion and open problems 15 

2. Preliminaries 16 

2.1. Bloch decomposition 16 

2.2. Nonlinear perturbation equations 18 

2.3. Nonlinear damping estimate 19 

2.4. Inverse modulation bounds 20 

3. Stability 21 
3.1. Spectral analysis and nonlinear decomposition 21 



Date: November 12, 2012. 

Mathew A. Johnson, University of Kansas, Lawrence, KS 66045, matjohn@math.ku.edu 
Research of M.J. was partially supported under NSF grant no. DMS-1211183 and by the University of Kansas General 
Research Fund allocation 2302278. 

Pascal Noble, Universite Lyon I, Villeurbanne, France, noble@math.univ-lyonl.fr 
Research of P.N. was partially supported by the French ANR Project no. ANR-09-JCJC-0103-01. 

L.Miguel Rodrigues, Universite Lyon 1, Villeurbanne, France, rodrigues@math.univ-lyonl.fr 
Stay of M.R. in Bloomington was supported by French ANR project no. ANR-09-JCJC-0103-01. 

Kevin Zumbrun, Indiana University, Bloomington, IN 47405, kzumbrun@indiana.edu 
Research of K.Z. was partially supported under NSF grant no. DMS-0300487. 

1 



3.2. Basic linear estimates 25 

3.3. Linear modulation bounds 28 

3.4. Nonlinear stability 31 
4. Behavior 34 

4.1. Refined spectral expansion and decompositions 34 

4.2. Refined linear stability estimates 35 

4.3. Refined nonlinear stability estimates 36 

4.4. Quadratic approximation 37 

4.5. Comparison with the linearized Whitham equations 38 

4.6. Nonlinear connection to the Whitham equations 42 
Appendix A. Algebraic relations 44 
Appendix B. The Whitham equations and asymptotic equivalence 44 
B.l. Formal asymptotics 44 
B.2. Asymptotic equivalence of systems of conservation laws 49 
B.3. Implicit change of variables 51 
Appendix C. Asymptotic equivalence of quadratic approximants 53 
Appendix D. Generalizations 55 
D.l. Extensions in type: quasilinear and partially parabolic systems 55 
D.2. Extensions in form: an abstract continuum of models 56 
D.3. Verification of (H1)-(H3), (D1)-(D3) 57 
D.4. Applications revisited 57 
References 60 



1. Introduction 

One of the triumphs in recent years in the dynamical study of partial differential equations (PDE) 
has been the development of a rigorous theory of modulation of periodic traveling waves in optics, 
pattern-formation, and other equations, both illuminating and expanding on formal predictions 
made by WKB-type expansion much earlier on, as for example in [68, 18]. Among many other 
results, we mention in particular the resolution in [59, 60, 61] using Bloch transform/renormalization 
techniques of the then 30-year open problem of stability of periodic reaction-diffusion waves with 
respect to localized perturbations 1 and, under nonlocalized perturbations, the rigorous verification 
in [15] using related techniques of the associated second order ("diffusive") formal WKB expansion 
in various settings, in particular in the small- wavelength limit. Most recently, the WKB expansion 
has been verified for solutions of reaction-diffusion equations in the long-time limit, in [57] by 
methods related to those of [59, 60, 61, 15] and in [25, 26] by rather different techniques originating 
from the study of conservation laws [29, 30, 74, 75]. 

From these analyses emerges the clear picture of asymptotic behavior as dominated by a single 
critical mode of the linearized equations, corresponding to translational invariance of the underlying 
equations, that is governed approximately by the phase equation of the formal WKB approxima- 
tion: (the integral of) a scalar convected Burgers equation. However, there are many physically 
interesting applications to which this well-developed theory does not apply. Specifically, when there 
exist conserved quantities, whether deriving from Hamiltonian structure/symmetries of the equa- 
tions, 2 or, as in the case of parabolic conservation laws considered here, simply from divergence form 
of the equations/conservation of mass, then there exist additional critical modes, and the formal 



1 Verifying formal predictions and rigorous spectral descriptions of [17]. 

2 As for example for the Korteweg-de Vries (KdV) equation [68, 62, 31, 32] or Euler-Korteweg system [9]. 
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WKB prediction becomes that of a more complicated hyperbolic-parabolic system of conservation 
laws rather than the scalar convected Burgers equation of the reaction-diffusion case. 

Perhaps the best-known example of such a model is the Kuramoto-Sivashinsky equation, for 
which the formal asymptotic description of behavior via a hyperbolic-parabolic system of con- 
servation laws was pointed out already in [18] under the alternative form of a damped scalar 
wave equation (the "viscoelastic behavior" of the title). Further examples arise in the modeling 
of viscoelasticity with strain-gradient effects, inclined thin-film flow, and Benard-Marangoni or 
surfactant-driven Marangoni flow; see Section 1.4 and Appendix D. 

Despite the physical motivation coming from such examples, until very recently there was no 
rigorous analysis of nonlinear stability or behavior in this (system) case. Indeed, as discussed in 
Remark 2.2, the renormalization techniques of the asymptotically scalar reaction-diffusion case in 
the presence of multiple characteristic speeds (linear group velocities) appear to break down. Using 
a technically rather different set of techniques, nonlinear stability under localized perturbations has 
now been shown for such systems in [53] 3 [29, 30, 33, 3, 4] in great generality, in particular resolving 
the longstanding open problem of nonlinear stability of spectrally stable Kuramoto-Sivashinsky 
waves, dating back to the numerical confirmation in [18] of existence of bands in parameter space 
of spectrally stable waves. However, up to now, asymptotic behavior has not been determined in 
this more complicated, system, case even for localized perturbations. 

More, as discussed in [29, 30], there was some question in this case precisely what behavior one 
might expect. Specifically, one-dimensional nonlinear modulational stability under localized per- 
turbations of spectrally stable periodic traveling wave solutions of viscous conservation laws was 
shown in [29] and [30] in two different cases, depending roughly on whether or not the wave speed 
is stationary to first order along the manifold of nearby periodic solutions. These two analyses 
were motivated by a common connection observed by Serre [62] to an associated formal Whitham 
averaged system obtained by WKB approximation. However, despite this shared heuristic descrip- 
tion, the authors observed some puzzling asymmetries in the results obtained; see, for example, 
the discussion in [30, Section 1.3] on the varying linearized and nonlinear decay rates obtained in 
these different cases under localized perturbations. In particular, the nonlinear decay rate obtained 
for localized perturbations in the stationary case was slower than what might be guessed from the 
formal Whitham approximation with zero initial phase modulation; however, it was left as an open 
problem whether this intuitive initialization was correct, or whether localized initial perturbations 
could excite the phase mode through nonlinear interaction in some way. 

Here, we sharpen and extend these previous results ([29, 30]) in several ways, in particular allow- 
ing more general, nonlocalized, perturbations and rigorously identifying time-asymptotic behavior 
as agreeing to leading order with the solution of the formal Whitham system with appropriately 
prescribed initial data. Our analysis loosely follows, and also greatly extends, the approach of 
[25, 26] in the reaction-diffusion case; as noted earlier, we do not see a way to apply here the more 
familiar techniques of [59, 60, 61, 15, 57]. In the process, we explain the asymmetries observed in 
[29, 30] as connected with the different ways that initial data align with characteristic modes for 
the common Whitham system governing large-time asymptotics. 

A striking consequence of our results is that spectrally stable waves about which wave speed 
depends to first order on wave number alone are not only boundedly nonlinearly stable, but asymp- 
totically stable with respect to localized perturbations. This resolves a question brought up early 
on in [50, 51, 62] that was left open in the analysis of [29, 30]. On the other hand, with respect to 
nonlocalized perturbations, waves in the two different cases behave essentially alike. 

To put things another way, we show that the case that the part of the Whitham system corre- 
sponding to phase perturbations decouples from the rest of the Whitham equations yields decay 



3 Concerning the more tractable (since faster-decaying) three and higher dimensional case. 
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rates exactly corresponding to those of the (scalar 4 Whitham equation) reaction-diffusion case, both 
for localized and nonlocalized perturbations. Indeed, we find that reaction-diffusion and conserva- 
tion laws can be put in a common framework 

iH + f(u) x + g(u) = (B(u)u x ) x , ueR n , 

consisting of a continuum of models, with / = corresponding to the reaction-diffusion case and 
g = to the conservative case, for which a complete classification of behavior can be obtained. 

This analysis puts the conservative theory now on a par with that of the reaction-diffusion case, 
at least as far as time-asymptotic stability and asymptotic behavior. It is an interesting open 
problem to reproduce in the conservative case a small-wavelength description as obtained in [15] 
for the reaction-diffusions case. See [48, 49] for some preliminary results in this direction. 

1.1. Slow modulation behavior. We begin by emphasizing some insights gained from the WKB 
approximation process, which requires, first, a description of nearby periodic traveling waves. For 
definiteness/clarity of exposition, we restrict to the simplest case of a semilinear second-order 
parabolic system of conservation laws. However, our analysis extends with little change to the 
quasilinear 2r-parabolic or (under appropriate structural conditions as in [35, 72, 74]) the sym- 
metrizable hyperbolic-parabolic case; see [3, 4, 5, 33] for related analyses in these and more general 
situations. Examples include periodic solutions of the equations of one-dimensional viscoelasticity 
with strain-gradient effects [50, 7, 69] and of the Kuramoto-Sivashinsky equations [34, 63, 18] and 
Saint- Venant equations [16, 1, 46, 47] modeling inclined thin film flow. We discuss in Appendix D 
the changes needed to handle these interesting physical applications. 

Consider a periodic traveling-wave solution of a parabolic or "viscous" system of conservation 
laws 

ut + f(u) x = u xx , 

u,f valued in R n , x,t G M, or, equivalently, a standing-wave solution u(x,t) = U(x) of 
(1.1) u t + k(f(u) x -cu x ) = k 2 u x 



h XX 1 



where c is the speed of the original traveling wave, and the wave number k is chosen so that 

(1.2) U{x + \) = U{x). 

Integrating the traveling- wave equation kU" = f(U)' — cU' obtained by substituting u{x,t) = 
U(x) in (1.1), we obtain 

(1.3) W = f(U)-cU + q, 

where q G M. n is a constant of motion. Setting Uq := U(0), we have evidently (2n + 2) parameters 
(k, c, Uo, q) determining candidates for periodic solutions, and n constraints U(l) = Uq, suggesting, 
in the absence of additional special structure 5 that the set of nearby periodic solutions form a 
manifold of dimension n + 2. Denoting by M := U (x)dx the mean of U, we make the genericity 
assumptions: 

(HI) / G C K (R n ) for some K > 4. 

(H2) Up to translation, the set of 1-periodic solutions of (1.1) (with M, k replacing M, k) in the 
vicinity of U, M = M, k = k, forms a smooth (n + l)-dimensional manifold 

(1.4) { (U(M, k; •), c(M, k)) \ (M, k) G O } = { (U M > k (-),c(M, k)) I (M, k) G O } 



4 See Remark 1.2 below. 

5 For example, Hamiltonian structure or existence of additional conserved quantities other than q [62, 30, 9]. 
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where is some open subspace of M. n+l containing (M, k) and the role of M is defined 
implicitly by 



(1.5) M := f 1 U M ' k (x)dx. 

Jo 

Then, the formal approximate solution of ut + k(f (u) x — cu x ) — k 2 u xx = obtained by a nonlinear 
WKB expansion, as derived to varying orders of accuracy in [62, 52, 48, 49], is 

(1.6) u(x,t) « uWriWiVfat)), 

where the mean A4 and wave number k := k^ x satisfy the Whitham equations 

(1.7) M, + HF--cM h= P( Al M x + ^U 

Kt + k(-u - ck) x = k {d 2 iM x + d 22 K x ) x , 

an enlarged, (n + 1) x (n + 1) system of viscous conservation laws, where uj(M,k) = —Kc(Ai,n) 
denotes time frequency, F(M.,k) := f(U M ' K (x))dx mean flux, and dij{M.,n) are determined 
by higher-order corrections as described in [48, 49]. For convenience of the reader, we recall these 
derivations in Appendix B. The phase ^ may be recovered through the solution of 

(1.8) ^ t =u(M,k) + ck + M 21 (M,k)M x + kd 22 (M,n)K x . 

Remark 1.1. Assumption (H2), corresponding to evolutionarity of (1.7) considered as an equation 
on the manifold of periodic solutions, is necessary for spectral stability in the sense usually defined; 
specifically, as described in Lemma 1.6, it is implied by condition (D3) below. Thus, there is no 
loss of generality, and considerable gain in clarity, in assuming (H2) from the outset as we do here. 

Remark 1.2. Since our analyis is built to deal with nondegenerate cases, when we treat systems 
where bulk forces are incorporated in all the equations as in reaction-diffusion or convection- 
reaction-diffusion systems in nondivergence form, a similar count of dimensions leads us to assume 
that, up to translation, the set of 1-periodic solutions forms a smooth 1-dimensional manifold 

{ (U(k; •), c(k)) | k G n } = | (U k (-), c{k)) I k G n } 

where Q is some open interval, with no constant of integration, hence no additional parameters M. 
involved. The analog of (1.7) is then a scalar equation for the evolution of local wavenumber k. 

Three kinds of modulation are involved in (1.6): modulation in phase, wave number and mean. 
However, a prominent role is played by modulation in phase. Indeed, it is a familiar scenario that 
stability of patterns involve description of the evolution of various modulation parameters, and that 
among them the major role is devoted to parameters determining spatial positions. Note however 
that whereas for patterns whose variation is essentially localized in space, such as fronts, solitons 
or multi-solitons, kinks, shocks, etc., there are a finite number of parameters to follow, with their 
evolution described by a system of ordinary differential equations, here, in the periodic setting, 
there is a continuous description involving function-valued modulation parameters whose evolution 
obeys a partial differential system (here (1.7)), the reduction being not from continuous to discrete 
dynamics but from dynamics about periodic solutions to dynamics about constants: an averaging 
process. Note also that the special role of phase is already encoded in the formal description (1.6)- 
(1.7) since the parabolic nature of (1.7) hints at (M-M,K-k) = (M-M, kd x (^ — ld)) « ^-Id. 

This explains why, for the purposes of the stability analyses of [29, 30], it was sufficient to retain 
from (1.6)- (1.7) only the coarser approximation 

(1.9) u(x,t) ~ U (M ~ k \y(x,t)) 
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neglecting all but phase modulations. Yet in doing so one gives up any hope to describe the 
precise behavior of the phase * appearing in (1.9) since this would require a full modulation 
approximation and in particular knowledge of M.. Without this precise description of the phase, 
decay rates obtained for ^ — Id and its derivatives may indeed seem mysterious. 

In contrast, let us explain what may be guessed from (1.6)-(1.7) about behavior under localized 
perturbations, that is when initially kq — k is mean-free. To which extent this simplification will lead 
to higher order decay rates for \I> — Id or d x (^> — Id) (directly needed to analyze (1.9)) is of course 
related to whether at some order the hyperbolic part of the equation for Kt uncouples from the full 
system (1.7). In particular, if to second order oj is independent of Ai then one recovers for the 
phase higher decay rates corresponding to simpler systems for which there is no extra parameter j\A 
and (1.7) is reduced to a scalar conservation law, typical examples being reaction-diffusion systems 
treated in [25, 26, 57]; in other words decay rates for wave number perturbation are those of a 
solution of a viscous Burgers equation with mean-free initial datum. If, on the other hand, oj is 
independent of M only up to linear order, then intermediate decay rates are obtained, 6 slower 
than those for reaction-diffusion systems but faster than those for the general situation when no 
uncoupling is present or when the wave undergoes a nonlocalized perturbation. 

The latter observations, to be established rigorously in the following, were not only inaccessible 
to proof but, as discussed in [29, "Discussion and open problems"], actually undecidable from the 
point of view of the lower-order description (1.9) of [29, 30]. Indeed, translated into the present 
terminology, the question posed in [29] which of if, ^> x is the primary variable (with respect to 
true behavior) is essentially the question whether a localized initial perturbation induces nontrivial 
data for k in (1.7), the answer to which is a key step in our analysis, and a rather technical one. 

With this is mind, our strategy will be first to validate the scenario (1.6)-(1.7), then to derive 
some consequences from the analysis of (1.7). But we first need to give precise definitions of terms 
such as "spectrally stable" ((D1)-(D3) below), "localized perturbation," and "linearly uncoupled." 

1.2. Setting and preliminary observations. Linearizing (1.1) about U yields the periodic co- 
efficient equation 

(1.10) (dt-L)v = 0, Lv :={k 2 d 2 x + kcd x -kd x A)v, A(x) := df (U (x)) , 

where here L is considered as a closed operator acting on L 2 (IR;]R ra ) with densely defined domain 
H 2 (R; W 1 ). 7 Introducing the family of operator- valued symbols 

L 5 :=e-^-Le i? - =k 2 (d x + iZ) 2 + k(d x + i£)(c-A), £e [-tt.tt], 

operating on periodic functions on [0,1], determined by the defining relation 

(1.11) L(e*-f) = e*-(Ltf) for / G H 2 CT ([0, 1]), 

we define following [59, 60, 29, 30, 33, 4] the diffusive spectral stability conditions: 
(Dl) a{L) C {A | RX < 0}U{0}. 

(D2) There exists a 9 > such that for all £ G [-tt,tt] we have a(L^) C {A | KA < -6>|£| 2 }. 
(D3) A = is an eigenvalue of Lq with generalized eigenspace So of dimension n + 1. 

Remark 1.3. As the coefficients of L are 1-periodic, Floquet theory implies that the spectrum of 
L considered as an operator on L 2 (R) is purely continuous, and that A G <r(L) if and only if the 



"These intermediate decay rates require some assumptions about characteristics speeds provided by assumption 
(H3) below; see Remark 1.21. 

Henceforth, in our notation for Lebesgue and Sobolev spaces we will suppress the definition of the range; in 
particular, we will write L P (R) for the equivalence class of p-integrable R n -valued functions L P (R; E n ). 
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spectral problem Lv = Xv has an L°°(R) eigenfunction of the form v(x;\,£) = e i ^ x w(x; A, £) for 
some £ G [— 7r,7r] and u/(-;A,£) G Lp Cr ([0, 1]); that is, that 

<^(R) {L) = \J ^L2 er ([0,l]) (^) • 

£e[-7T,7T] 

See [19] for more details. In particular, since the spectrum of a given operator is purely discrete, 
consisting of isolated eigenvalues of finite multiplicity which, furthermore, depend continuously on 
£, this provides a discrete parameterization of the essential spectrum of L. 

Applying standard spectral perturbation theory [34] to the operators L^, we obtain from (D3) 
that there exists for £ sufficiently small an invariant (n + l)-dimensional subspace E^ of and 
associated total eigenprojection n(£) bifurcating analytically from Eo and its associated eigenpro- 
jection Ho, with all other eigenvalues of having real part uniformly bounded above by some 
negative constant 8 . 

Remark 1.4. Variations OmUu^ja, U' along the manifold of nearby periodic solutions lie always 
in So (see proof of Lemma 1.5), accounting for n + 1 dimensions, whereas d^U^^j ^ usually does 
not. Thus, (D3) is an assumption of minimal dimension, corresponding also to the assumption 
that there are no neutral modes of Lq other than those accounted for by modulation along the 
"slow manifold" U^ M ' k \- + /?) as in (1.6). Assumption (D2) may be recognized as an "asymptotic 
parabolicity" assumption encoding time-asymptotic diffusion comparable to that of a second-order 
heat equation; it is directly related to the parabolicity of system (1.7). Assumption (Dl) encodes 
that the spectrum corresponding to marginal stability is minimal, thus confined to {0}. 

The following observation hints what may be gained at the linear level from "linear uncoupling" 
8mc\(M, k) / 0, at the same time relating Eo explicitly to variations along the manifold of periodic 
traveling waves nearby U. Note [62], that this condition corresponds with decoupling of the k 
equation in the first-order part of the linearization about (M,k) of the Whitham system (1.7). 

Lemma 1.5. Assuming (H1)-(H2) and (D3), Lq has a nontrivial Jordan block at A = if and 
only if 8mc ^ at (M,k), or equivalently 8moj{M ,k) ^ in (1.7), in which case there is a single 
Jordan chain of height two ascending from the genuine right eigenfunction U' . In either case, 

E = Spa,n{d M U,U'}. 

Proof. Variations &m U comprise an n-dimensional subspace of solutions of 

L d M U = -k{d M c\ { M,k)W' , 

i.e., either the eigenvalue or generalized eigenvalue equation at A = 0, complementary to genuine 
eigenfunction U' (mean zero, hence independent of 8mU by duU(x)dx = Id, a consequence 
of (1.5)). Comparing dimensions, we thus have Eo = Spanj^M^, U'}. If 9mc\^j^ = 0, then 
Span{c?Af U}, hence also Eo, consists entirely of genuine eigenfunctions. If, on the other hand, 
there is a direction v in which 8mc\^m k) ' v ^ ^' then there is a single generalized eigendirection 
Span{<9M^ • ^} over U' . □ 

The following lemma justifies the apparently special assumption (H2). 

Lemma 1.6. Assuming (HI), (D3) implies (H2). 



Mark that the important property we have used through these arguments and obtained from the introduction of 
Bloch symbols is compactness, which plays for this periodic setting the role of the finite dimensionality that one 
obtains with Fourier symbols associated to constant-coefficient operators. 
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Proof. Observe that variations with respect to (Uo,q) in (1.3) satisfy the eigenvalue ODE for Lq 
while variations in c satisfy the generalized eigenvalue equation associated with genuine eigenfunc- 
tion U' , hence the subspace of all elements of their linear span satisfying the constraint of periodicity 
is contained in £q- Condition (D3) implies that this subspace is dimension < n + 1, from which 
we may deduce that the n-dimensional periodicity condition U (1) = Uq is full rank at the values 
(k, c, Uo,q) corresponding to IJ, as the kernel with respect to the 2n + 1 parameters (c,Uo,q) is 
dimension < n+ 1. This guarantees existence of a smooth parametrization U(a,(3; •) = U a (- + /3), 
with (a, f3) lying in some open set of R n x R, of the manifold of nearby periodic solutions, where- 
upon we may conclude using (D3) again that det d a (M, k)u^ p\ ^ by [52, Theorem 1.3]. 9 Thus 
(a, (3) I—?- (M,k,(3) is locally invertible, yielding a smooth parametrization by (M,k,(3). □ 

We complete our set of assumptions with a final nondegeneracy condition corresponding to strict 
hyperbolicity at (M, k) of the first-order part of (1.7), namely, the assumption: 

(H3) the eigenvalues aj of d ^ F ~Q^M~^~ ck ^ \ (m k) are distinct. 

The role of this assumption is made clearer by the following connections established at the linear 
spectral level in [62, 52] between the Whitham system (1.7) and long-time (~ low-frequency for 
parameters, ~ low-Floquet exponent £ for original functions) behavior. 

Proposition 1.7 ([50, 62]). Assuming (H1)-(H2), the [n + 1) -multiplicity eigenvalue A = of 
at £ = bifurcates in a differentiable way for £ ^ sufficiently small into n + 1 eigenvalues 

(1.12) X j (0 = -ika j + o(0, j = l,...,n + l, 

where a,j are the eigenvalues of d(M,k)(F ~ cM,—co — ck)\^^, that is, kaj are the characteristic 
velocities of the first-order part of the Whitham modulation equations (1.7) at the values (M,k) 
associated with U. Moreover, assuming (H1)-(H3), this bifurcation is analytic. 

Proposition 1.7 was established in [62, 52] using direct Evans function calculations. In Section 3.1, 
we provide an alternative proof based on direct spectral perturbation expansion (as in [48, 49]) 
which is better suited to the techniques utilized in our analysis, and yields also information about 
eigenprojections. In the meantime, we observe the following interesting corollary. 

Corollary 1.8. Assuming (H1)-(H2), a(L) C {A|KA < 0} implies that characteristics aj are real. 
That is, weak hyperbolicity (aj real) 10 of the first-order Whitham equations at (M,k) is necessary 
for spectral stability; in particular both (Dl) and (D2) imply this notion of weak hyperbolicity. 

Corollary 1.8 gives rigorous validation of the Whitham equations as formal predictors of stability. 
Indeed, their hyperbolicity is often used as a definition of "modulational stability." 

Remark 1.9. Assumption (H3) provides two kinds of regularity in a simple unified way: on one 
hand it gives the analyticity of critical spectral modes of L, and on the other hand, when combined 
with weak hyperbolicity (here following from (Dl) or (D2)), it yields strict hyperbolicity of the 
Whitham's system. We expect that, by usual considerations, it could be replaced with symmetriz- 
ability of the Whitham's system and a direct smoothness assumption on spectral expansions. 

We still need to say some words about what we mean by a nonlocalized perturbation. First, 
a localized perturbation of U is something that may be written as U + v with v localized (and 
smooth), say v 6 L X (R) n H K (M). Note that in order for the process of gluing together a left 
portion of the original wave, some function on a finite interval, and a right portion of the original 

periodic Evans function computation showing that the (n + l)st derivative of the Evans function at A = is 
proportional to det d a (M, k) , hence det d a (M, k) | ^,0) 7^ is necessary for (D3) (implicit also in the earlier work [62] ) . 
10 Full hyperbolicity requiring of course also semisimplicity of a,- as eigenvalues of d( ~ F ~g^}~^~ ck ^ \ (M,k) ■ 
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wave to yield a localized perturbation (according to our definition), the left-hand and right-hand 
copies of the original wave should be in phase. It is this stringent condition (corresponding to 
a mean-free condition for the local wave number) that we want to relax in going to nonlocalized 
perturbations. 

Thus, rather than localized perturbations, we consider perturbations of the type (U + v)oty with 
v and d x (^ — Id) localized, allowing for changes in phases between limiting left and right waves 
(^ — Id is not localized) but not a change in the waves themselves, for instance in its wave number 
or its mean. In other words, our nonlocalized perturbations will still yield localized data for the 
Whitham system (1.7). 

1.3. Results and implications. With these preparations, we are ready to state our two main 
theorems. The first one is an extension of [29, 30] to stability under nonlocalized perturbations. 
The second one provides asymptotic behavior by validating the scenario (1.6)-(1.7). 

Here, and throughout the paper, given two real valued functions A and B, we say that A < B 
or that for every x 6 dom.{A) n dom(-B), A(x) < B{x) if there exists a constant C > such that 
A{x) < CB{x) for each x £ dom( J 4) n dom(S). Even in a chain of inequalities, we will also feel 
free to denote by C harmless constants with different values. 

Theorem 1.10 (Stability). Let K > 3. Assuming (Hl)-(HS) and (Dl)-(DS), let 

Eq := ||£to(- - MO) ~ ^(•)llL 1 (R)nff i:f (R) + l|3a;fro||L 1 (R)ntf*' (K) 

be sufficiently small, for some choice of phase shift ho. Then, there exists a global solution u(x,t) 
of (1.1) with initial data uq and a phase function tp(x,t) such that ip(-,0) = ho and, introducing a 
global phase shift V>oo = (ho(-oo) + ho(oo))/2, for t > and 2 < p < oo, 



(1.13) 

and 
(1.14) 



\u(- - ^(.,t),t) - u( ■ )|| LP(K) < e (i + ty^-yp) 

IIV^VWIIlwm) <^o(l + i)-5 (1 - 1/j,) , 



\\u(t)-U(- -^oo)|U'»(R) > ll"0(*) - ^ooHl-(R) < Eo, 

\\u(t) - ^IU°°(R) ^ E + |V>oo mod 1|; 
in particular 

(1-15) \H;t) - U(- + H;t))\\ LPm Z ^o(l + t)-^M. 

Remark 1.11. The above result suggests the introduction of "space- modulated distances" 11 

5x(u,v) = inf \\u o -fy - v\\ x + ||5 X (* - Id)||x- 

In these terms, it states 5 L i nH K — (5 L 2 nZ/00 asymptotic stability, 12 and 5 L i nH K — \\ ■ bounded 
(orbital) stability. In the following, among other things, we discuss situations, involving appropriate 
uncoupling conditions, under which one may go from this "space-modulated" asymptotic stability to 
the usual || • ||ii n ij^ — II ■ ||L 2 nL°° asymptotic stability. Note that this notion of "space-modulated" 
stability is a natural generalization of the more common one of orbital stability for patterns with 
localized variations (e.g. fronts, shocks, kinks, solitons, etc.), where the above infimum is taken 
over uniform translations only. 



These are not true distances, but rather measures associated with a seminorm. 
: The proof gives also a S L i nHK — 8 h k asymptotic stability. 
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Theorem 1.12 (Asymptotic behavior). Let n > 0, arbitrary, and K > 4. Under the assumptions 
of Theorem 1.10, and suitable parametrization there exist M(x,t), and ip(x,t) such that tp(-,0) = ho 
and, with global phase shift ?/>oo = (ho(— oo) + /io(oo))/2, for t > 0, 2 < p < oo, 

_ ^(., t) )t ) _ l7 Af+Af(,t) ) fc/(i-^(-,t))(.)|| iP(R) < E ln(2 + t) (1 + i)-f , 
( L16 ) ||(^,M)(t)|| LP(M) < i?o(l + t)^ (1 - 1/p) , 

" "0oo||l°°(R) < -^0- 



Moreover, setting = (Id--0(-, i)) -1 , « = fcc^, .M(-,t) = (M+M(-,i))o*(-,£) ; and defining 

(Ai\y, K w) and to be solutions of equations (1.7), and (1.8) with initial data 

M w (;0) = M + u -Uo^(.,0) + ( 1 -l) (Uo*(.,0)-M), 

1 j kw(;0) = kd x *(-,0), 

*w(;0) = *(-,0), 

we Ziawe, /or i > 0, 2 < p < oo, 

\\(M,K)(t) - (M w ,K W )(t)\\ LP(R) < Eo(l + t)-^- l ^-\+\ 

(1.18) 

\m) - *w(*)iii»(R) < ^o(i + t)-2( i -vp)+^ ; 

in particular, k = kd x ^f, k\v = kd x ^\y> and 



(ll9) M-,t)-u M ^>^m-,t))h P{ m < EoH2 + t)(i + t)--,, 

■ ||n(-,t)-C/- M ^(-' t W(-'*)(vI/ H ,(.,t))|| LP(R) < £ (l + tr^ (1_1/p)+?? - 

Remark 1.13. A suitable choice in parametrization is made here to ensure that interdependences 
on k and (3 in U^ M ' k ^ (• + /3) are compatible with the expected relation between local phase and local 
wave number, k = kd x ^> . Explicitly, our normalizing choice is performed in (4.2) (which involves 
<2V t +i(0) = u ad i defined in Proposition 3.1) to get Lemma 4.1. 

Remark 1.14. Prescription of the initial data (1.17), especially for Ai\y(-, 0), is a subtle point 13 not 
evident from the viewpoint of formal approximation (1.6)-(1.7). In particular, the appearance of a 
term related to phase variations in (1.17) (i) arises in our analysis through a detailed study of the con- 
tribution of high frequencies of the local wave number to variations of the low Floquet number part 
of the solution (see the key equality (3.27)). Nevertheless, in the end, each term involved in (1.17) (i) 
has a nice interpretation, with uq — U o *(-,0) accounting for the contribution of amplitude vari- 
ations to the initial perturbation of the mean M and (l/d x ^(-,0) — 1) (U o \l/(., 0) — M) encoding 
the contribution of period variations. To be more specific, on one hand, setting do = uq — Uo ^(-, 0), 

~ 1/2 ~ 

we observe that do differs from x i — > f-1/2 do{x + y) dy, which is easier to interpret, by a localized 
zero-mean function, a difference that is asymptotically irrelevant 14 at our level of description. Like- 
wise, assuming on the other hand validity of the approximation U o \I>(., 0) ~ U + ( , J / (-, 0) — Id^t/ 7 

1 /2 — 

leads us to consider x 1 — > J_y 2 (^(x + y, 0) — (x + y))U'(x + y) dy which is 

xi— >- [ 2 i {d x V{x + y,0)-l)U(x + y)dy + U (x - 1/2) + 1/2, 0) - *(x - 1/2, 0)] 

where, up to localized zero-mean functions, the first part of the sum is — (d x ^/(-,0) — l)U and 
the second reduces to (d x ^(-,0) — 1)M. Note that the fact that in the end of this latter formal 
computation we recover the formula of the Theorem only in an approximate way reveals that the 



13 This issue does not arise in the related analysis [26] of the reaction-diffusion case, as M does not appear. 
14 This follows from the general theory for parabolic systems of conservation laws, see for instance Proposition B.l. 
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first approximation in the argument is invalid. Yet this incorrect approximation possesses a correct 
analog (see Section 2.2) leading to the initial data of the Theorem. Note also that any small 
localized perturbation of (M, k) may be realized as initial data in (1.17) by appropriately choosing 
ho and uq so that at our level of accuracy the full dynamics of (1.7) near (M, k) are present in (1.1) 
around U. 

Remark 1.15. Bounds (1.19) are both of form (1.6), with k^f x = k, validating a slow modulation 
picture of behavior. Yet comparison of (1.19) (i) with (1.19) (ii) reveals that by allowing (M.,k) to 
satisfy (1.7) only in an approximate way we here construct a phase modulation more accurate at 
least by factor (1 + t)~ 1 / A ~ r i than that of the formal Whitham construction (1.6)-(1.7). 15 This is 
again a manifestation of the fact that comparisons to periodic functions are very sensitive even to 
small perturbations in description of respective spatial positions as encoded by local phases. 

Decay for localized perturbations. Standard bounds on localized solutions of systems of parabolic 
conservation laws of form (1.7) (see Proposition B.l below) show that in general 

IK*) - Hlv { m) ~ (1 + t)"^ (1 - 1/p) , (t) - Id|| L oo (K) ~ 1, 

so that 

(1.20) ||^(-.*).<.*)(*(.,t)) - C?(-)||l~ w ~ ||*(t) -Id|| LOO(ffi) ~ 1 

S> (1 + t)^2 +r i. Together with (1.19), this rigorously validates the formal Whitham approximation 
while simultaneously showing that estimates (1.13)-(1.14) are sharp for nonlocalized perturbations, 
ho 0, leading always to nontrivial localized data (1.17) in kw for the Whitham system (1.7), and 
for localized perturbations ho = are sharp in the generic case where no uncoupling is present. 

However, an interesting further implication of (1.19) is that when Kyy decouples to sufficient 
order from the rest of the Whitham equations, the estimates (1.13)— (1.14) can be sharpened for 
localized perturbations, to yield asymptotic decay. To make this latter point precise, we introduce 
the following definitions. 

Definition 1.16. We say that a wave is linearly phase-decoupled if dMc\(M,k) = or > equivalently, 
8mw(M, k) = in (1.7): that is, k is a characteristic variable for (1.7) at the special point (M,k). 
Otherwise, we will say that it is linearly phase-coupled, or simply "generic type." 

Definition 1.17. We say that a wave is quadratically phase- decoupled if both dMc\^,k) = an d 
^M c \(M,k) = 0- or > equivalently, <9m^(M, k) = and d]^oj(M, k) = in (1.7). 

This simple classification unifies and generalizes a number of observations in [50, 51, 29, 30]. 
As we have seen in Lemma 1.5 and shall discuss further in Remark 3.2, linear phase-decoupling 
implies that to linear order the phase behaves similarly as in the reaction-diffusion case studied 
in [59, 60, 28] (localized perturbations) and [25, 26, 57] (nonlocalized), for which the associated 
Whitham system consists of a single equation 16 

K t — fc(w(/c) + ck) x = k 2 (d(K)K x ) x , oj(k) = —kc(k), 

encoding the nonlinear dispersion relation induced by the periodic existence theory. In particular, 
we shall show that spectrally stable linearly phase-decoupled waves like spectrally stable reaction- 
diffusion waves are linearly and nonlinearly asymptotically stable and not only boundedly stable with 
respect to localized perturbations. Yet at the nonlinear level the phase behaves similarly as in the 
reaction-diffusion case, sharing the same decay rates, only if quadratic decoupling is present. The 

15 Here we are using the additional fact (not explicitly stated here) that estimate (1.19) (ii) is sharp. On the other 
hand, we do not expect (1.19) (i) to be sharp (see Remark 2.1). 
16 See Remark 1.2. 
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situation is actually simpler in higher dimensions where the asymptotic dynamics are essentially 
linear and the distinction between linearly decoupled and generic cases is sufficient [53, 29]. We 
make these observations precise in the following corollary. For a proof, see Appendix B.2.2. 

Corollary 1.18 (Localized perturbations). Under the assumptions of Theorem 1.12, for localized 
perturbations ho = 0, and -ip defined as in Theorem 1.12, if U is linearly phase- decoupled and 
E± := Eo + HI ■ | (uq — C/)||li(k) is sufficiently small, then, for t > and 2 < p < oo, 

\\v x ,tm\\ LPm <m+ t)-^M-\^, 
\\u(t)-u\\ LP(R) , \m)\\ LP{R) <E 1 (i+t)^ +r ', 

while if U is quadratically phase- decoupled and Eq is sufficiently small, then 
(1 \\V*Mt)\\r*<R) £ E,(l + t)-^' l M-^\ 

\\u(t) - u\\ LPm , wmwipm $ E (i+t)-^M+v . 

for t > and 2 < p < oo. In either case, U is nonlinear ly asymptotically stable from 
L L (R; (1 + \x\)dx) n H K (R) to D>(R), for all 2 < p < oo. 

Remark 1.19. Comparing bounds (1.22) for localized perturbations and (1.13)— (1.14) for nonlo- 
calized perturbations to those obtained in [57, 25, 26], we see that bounds for the quadratically 
phase-decoupled case exactly match the bounds for reaction-diffusion systems. 

Remark 1.20. One may wish to express localization as a mean- free condition on d x h$. Actually, 
in the above bounds, the condition ho = may indeed be relaxed to the condition that d x ho is 
mean- free and either E\ := Eq + \\\ ■ \ d x ho\\ ^ (r) is small in the quadratically phase-decoupled 
case or E\ := Eq\\\ ■ \ d x ho\\ L i^ + ||| • | (^o( - — ^o( - )) — ^)IIl 1 (r) i s small in the linearly phase- 
decoupled case. In either case, the conclusion is (asymptotic) orbital stability with asymptotic 
phase = (ho(— oo) + /io(oo))/2 (in the sense of [21]). 

Remark 1.21. For the analysis of localized perturbations in the linearly phase-decoupled case, 
assumption (H3) plays a role deeper than just providing regularity in a simple way. Indeed, in this 
case, the extra damping (1 + 1) -1 / 4 in (1.21) encodes the fact that quadratic interactions between 
diffusion waves traveling at different characteristic speeds are asymptotically irrelevant ([40], see 
Remark B.4). Thus, here one should not expect to be able to replace (H3) with something weaker 
than: the linear group velocity associated to the wavenumber mode is different from all other 
characteristic speeds. 

1.4. Examples. Having established the importance for asymptotic behavior of the Whitham equa- 
tions, we now give some examples indicating their range of possible behaviors. In this section, we 
relax the restriction, made for expositional simplicity, to second-order parabolic semilinear systems 
of conservation laws and discuss a full range of models arising in applications, including reaction- 
diffusion equations (Example 1.22), equations with higher-order or partial diffusion (Examples 1.22, 
1.25 and 1.26), and even mixed conservative/nonconservative equations (Example 1.26). 

From the point of view of the present paper, the main example is Example 1.23, which illustrates 
for second-order parabolic semilinear conservation laws both phase-decoupling and phase-coupling. 
However, we emphasize that the analysis of all of these models may be carried out with minor 
changes within the same basic analytical framework set out here and in [29, 30]. We discuss this 
further in Appendix D, along with the question of numerical or analytical verification of the stability 
conditions, needed to conclude validity of the Whitham equations. 

To simplify the discussion, we restrict to the first-order part of the Whitham equations, which 
suffices to determine the main qualitative features of solutions- in particular, phase-decoupling vs. 
coupling- and has a common derivation/form [62] independent of second- and higher-order terms. 
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Example 1.22 ([60]). The Swift-Hohenberg equation 

(1.23) u t + (l + dl) 2 u-ru + f(u) = 0, 

where r 6 R is a bifurcation parameter and / is some sufficiently smooth nonlinearity, admits for 
certain values of r periodic waves of speed c = 0. This equation arises as a simplified equation for 
the Taylor-Couette problem and is proved to possess diffusively spectrally stable waves [60]. As a 
reaction-diffusion equation, with no conservative part, this yields (see, e.g., [24, 15, 57, 29, 30]) a 
scalar first-order Whitham equation 17 

Kt = 0. 

Example 1.23 ([50, 62, 7]). The equations of one-dimensional viscoelasticity with artificial vis- 
cosity and strain-gradient effects ("capillarity") may be expressed in Lagrangian coordinates, after 
a change of variables [58, 51], as 

T~t U x = £\T XX , 

u t - cf(t) x = e 2 u xx , 

where r = £ R rf and u = xt £ are derivatives of deformation x '■ x — > W 1 , a is the stress-strain 
relation of the elastic material, and £1,62 > are scalar coefficients related to viscosity /capillarity; 
see [7]. It is readily verified by energy considerations [62, 50, 55] that periodic waves may only have 
speed c = 0. By this, together with Galilean invariance with respect to shifts in u, we find, setting 
T, U, X to be the means of f , u, and <r(f ) over one period, that the associated first-order Whitham 
system has form 

T t -U x = 0, 

(1.25) I7 t -E(T,K)* = 0, 

K t = 0, 

hence, since co(T,U,k) = is evidently independent of (T,U), is quadratically (indeed, totally) 
phase-decoupled. That is, in terms of the phase equation principally determining behavior, (1.24) 
and (1.23) exhibit parallel behavior Kt = 0, 18 despite their different origins. For localized data ho = 
0, giving kq = k, this reduces to the first-order wave equation Tt — U x = 0, Ut — S(T, k) x = 0, which 
is strictly hyperbolic when E is monotone decreasing, as occurs for some but not all cases [69]. By 
Corollary 1.18, any spectrally stable periodic solutions of (1.24) would be nonlinearly asymptotically 
stable with respect to localized perturbations, answering a question posed in [51, 62, 29]. 

The same equations written in Eulerian coordinates (in terms of p := r _1 and m := u/t) 
are phase- coupled [62], hence at best only nonlinearly bounded stable, with the explanation that 
modulations in wave speed in this case lead to deviation of characteristic paths, hence solutions are 
no longer compared along the Lagrangian trajectories where they are most closely matched. To 
put things another way, coordinatization by Lagrangian markers accomplishes a substantial part 
of the modulation that in Eulerian coordinates yields decay estimate (1.13) (i). 

Remark 1.24. It has been shown in [50, 55] that under a wide variety of circumstances, in particular, 
always for one-dimensional deformations, d = 1, periodic solutions of (1.24) are spectrally unstable, 
whether or not the first-order Whitham equations (1-25) are of hyperbolic type. This shows the 
importance of the full diffusive stability conditions, beyond the intuitive conditions of Corollary 1.8. 
It is an interesting open problem whether there exist stable waves for d > 1 [69, 55]. 



1 'See Remark 1.2. 

18 At second order, a linear heat equation n t = d,K xx , d > 0. 
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Example 1.25. For the Kuramoto-Sivashinsky equation 
(1.26) ut + {u 2 /2) x + u xx + u xxxx = 0, 

setting U, E to be the means of u, and u 2 /2 over one period, and making use of the Galilean 
invariance x — > x — ct, u — > u + c, we find that c(U\ + U2, k) = c{U\, k) + U2 and T,(U\ + U2, k) = 
S(J7i, fc) + U1U2 + U2/2. It is known that, within a certain parameter range, (1.26) supports odd- 
hence mean-free- profiles with c = (see Remark 1.27). For these solutions, c(0, k) = 0, hence 
c(U, k) = U. Thus, the associated first-order Whitham system is linearly phase- coupled, of form 

(l27) ^ + (£([/*, ac) + (*7-t4) 2 /2), = 0, 

K t + ((U- U*)k) x = . 

Linearizing about constant solution (£/*, fe) and reintroducing the phase through k = tp x , this gives a 
second-order wave equation tptt+k(dk'^)(U i ,, fc) = in the phase [18] provided (dfc£)(f7*, < 0. 
As illustrated numerically in [5], linear phase-coupling has the effect that nonlocalized perturbations 
in the phase can arise even through localized initial perturbations. Numerical studies [18, 5] indicate 
that there exist "bands" in parameter space of spectrally stable waves, satisfying hypotheses (Hl)- 
(H3), (D1)-(D3); see Appendix D for further discussion. 

Example 1.26. The Saint-Venant equations for inclined thin-film flow appear in Lagrangian 
coordinates as 

n- u x = 0, 

(T 28) 

u t + (i2Fr l T~ 2 ) x = 1 - ru 2 + u(t- 2 u x ) x , 

where r is the reciprocal of fluid height, u is velocity averaged with respect to depth, x denotes 
a Lagrangian marker moving with the flow, and v and F are dimensionless constants, with force 
term 1 — ru 2 representing the balance between gravity and turbulent bottom friction. In terms of 
structure, this is intermediate between the reaction-diffusion case of (1.22) and (1.24), having a first 
equation in conservative (divergence) form and a second equation in nonconservative convection- 
reaction-diffusion form. The same derivation as for (1.7) yields the first-order Whitham system 

T t - U(T, k) x = 0, 

V 1 at- (c(T, k)k) x = 0, 

where T and U are defined as the means of f and u over one period, and (T, k) parametrize the 
associated two-parameter family of periodic traveling waves with speed c = c(T,k); see [33, 3, 49] 
for further details. For this model, the speed c is never zero, and in particular depends typically 
nontrivially on T. Thus, this system, like (1.27), is in general fully phase-coupled. Numerical 
experiments [3, 6] indicate that (1.29) can be either hyperbolic (consistent with stability) or elliptic 
(implying instability), depending on parameter values; moreover, there exists a band of parameters 
on which waves satisfy the stability hypotheses (H1)-(H3), (D1)-(D3). 

Remark 1.27 (Phase-decoupling and symmetry). As illustrated by Example 1.23, phase-decoupling 
is not always an isolated degeneracy on a special set of parameters, but for models with special 
structure may hold on an open set of parameters/waves; indeed, more, we may have c = 0. This is 
reminiscent of the well-known principle in the reaction-diffusion setting that, by reflection symmetry 
of the equations ut + f(u) = u xx , even-symmetric standing- wave solutions generically persist as 
families of solutions with c(k) = 0. For, otherwise, the fact that reflection preserves k would violate 
local uniqueness of solutions as a function of k. Alternatively, one may observe that zero-speed 
waves satisfy a Hamiltonian ODE, hence, by a dimensional count, exhaust the available dimensions 
in the set of nearby solutions. This principle is illustrated in the behavior cited in Example 1.22. 
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Likewise, in Example 1.23, one finds [33, 3] that zero-speed waves satisfy a Hamiltonian ODE 
identical to that of the reaction-diffusion case, with d free parameters given by a constant of 
integration, hence, by a dimensional count, generically fill up the (d + l)-dimensional set of nearby 
solutions, giving c(T,k) = 0. Similarly, the Kuramoto-Sivashinsky equations (1.26), are invariant 
under x — > —x, c — > — c, u — > —u, from which we may deduce that odd-symmetric zero-speed 
solutions generically persist, as cited in Example 1.25. For, otherwise, the fact that reflection 
preserves k would violate uniqueness with respect to k of solutions with fixed zero mean. 

1.5. Discussion and open problems. Our results extend to the conservative case the results 
established recently for reaction-diffusion systems in [57, 25, 26] regarding behavior, and extend to 
nonlocalized perturbations the results obtained for conservation laws in [28, 29, 33, 3, 4] regarding 
stability under localized perturbation. The method of analysis used here is similar to but much 
more complicated than the arguments used in [25, 26] to study the reaction-diffusion case and the 
reader is encouraged to consult these references as motivation in a simpler context. As noted above, 
the methods of [57] do not seem to apply. 

The main new difficulties overcome in the present analysis beyond that of [25, 26] are the treat- 
ment of nonlocalized perturbations in a way including the phase-coupled case, which has an essen- 
tially different Jordan block structure from that of the phase-decoupled case, and the identification 
of the Whitham equations with the asymptotic second-order modulation system arising naturally in 
our analysis via a system of integral equations. The latter task involves surprisingly subtle aspects 
not present in the reaction-diffusion case concerning the influence of phase modulation \I> on the 
mean Ai, first, through high-frequency resonances, on its initial data (see Remark 1.14), and, sec- 
ond, through the influence of the implicit nonlinear change of independent coordinates (2.5) used in 
our nonlinear iteration scheme on the form of the Whitham equations (see Section B.3). 19 Though 
we give a unified proof, regardless of coupling distinctions, mainly out of a desire for clarification 
of the essential features of modulation theory, a proof of Theorem 1.12 would not be much simpler 
had we restricted it to the linearly-uncoupled case. 

Comparing bounds (1.16)-(1.19) to the corresponding bounds for reaction-diffusion systems in 
[26, Theorem 1.3], we see that they are identical; that is, modulations are equally well-approximated 
for systems of conservation laws as for reaction-diffusion systems by the formal Whitham approx- 
imation (1.7). It follows (through Proposition B.l) that for nonlocalized perturbations, behavior 
and decay rates are also essentially identical in these two cases, as the formal asymptotics suggest. 

On the other hand, for localized data, ho = 0, the decay estimates established for the reaction- 
diffusion case in [53, 28] are faster by a factor (1 + t)~i, or "roughly one derivative" in terms of 
standard heat bounds, than those of Theorem 1.10 in the generic conservative case, which are the 
same for localized as for nonlocalized perturbations. As discussed in [30], linearly phase-decoupled 
waves exhibit a similar behavior at the linearized level. However Corollary 1.18 shows that a 
quadratic decoupling is needed to yield a similar behavior at the nonlinear level, while waves that 
are linearly phase-decoupled show an intermediate behavior, asymptotic stability but with slower 
rates. 

At broadest level, our results confirm that an accurate distinction is not between reaction- 
diffusion and conservation law systems but between phase- decoupled, and non-phase-decoupled waves 
(the former trivially including the reaction-diffusion case), which indeed exhibit the asymptotic 
behavior suggested by their common formal asymptotic description in terms of the Whitham equa- 
tion^). A key new piece of information supplied by our analysis that is not present in the formal 
Whitham derivation is the way in which initial data is taken on by the time-asymptotic Whitham 
system. Though the ultimate prescription in Theorem 1.12 is simple, it is determined by a detailed 



Recall that M does not appear in the Whitham equation for the reaction-diffusion case. 
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series of linear and nonlinear estimates that are quite far from the techniques of formal asymptotic 
expansion. 

We stress, finally, that our nonlinear iteration scheme is quite robust. In particular, there is no 
use of analytic semi-group properties in our argument, hence it is not sensitive to changes in order 
(e.g., to KS or KS-KdV) or type (e.g., quasilinear or degerate as for Saint- Venant) of the equations 
under study. To control regularity, we mainly use nonlinear H s damping estimates and C° semi- 
group resolvent bounds in H s ; both given by standard energy estimates techniques (Kawashima's 
if needed [35, 72, 74]). This allows a wide range of generalizations, as discussed in Appendix D. 

The diffusive spectral stability conditions (D1)-(D3) have been shown numerically to hold for 
"bands" of stable periodic waves, in several interesting settings, and with a high degree of precision; 
see, for example, [18, 4, 5]. We view the numerical proof of these conditions, or analytical proof in 
interesting asymptotic limits (in the spirit of [27]), as important open problems for the theory. The 
determination of asymptotic behavior in the small-wavelength limit, analogously as in [15] for the 
reaction-diffusion case, is another important open problem. Likewise, extensions to the case that 
not only the phase but the wave number k = *$> x has different values at plus and minus infinity, 
corresponding to Riemann data for the Whitham equation (1.7), is an interesting direction for 
future investigation; see [15, 8] in the reaction-diffusion case. 

Plan of the paper: The plan of the rest of the paper is as follows. In Section 2 we set up the 
framework of the proofs (introduction of the phase, integral transform, etc.), then in Section 3 we 
prove Theorem 1.10. Finally, in Section 4, we give the proof of Theorem 1.12. In Appendices A-D 
we provide, respectively, algebraic relations obtained by differentiation of the traveling-wave ODE, 
derivation of the Whitham system (1.7), simplifications of this system afforded by the theory of 
parabolic conservation laws for data consisting of localized perturbations of a constant state, and 
some hints regarding generalizations to other situations of mathematical or physical interest. 



2. Preliminaries 

In this section, we discuss several technical preliminaries that we will find useful throughout our 
analysis. Specifically: 

• we introduce the Bloch transform, the fundamental integral transform that we will use in 
deriving all of our linear estimates; 

• we show how phase shift ip is introduced in (1.1) and how this affects the equations; 

• we prove a nonlinear damping energy estimate (here simply following from the parabolic 
nature of (1.1)), establishing that high derivatives of the solution decay in time at least as 
fast as low derivatives so that technical issues are mainly, as expected, in decay rates and 
localization (small Floquet or small Fourier numbers) and not in regularity; 

• we give estimates useful to analyze the effect of a change of independent variables on our 
bounds. 

These preliminary issues were already present implicitly or explicitly in [29, 30]. 

2.1. Bloch decomposition. To begin our analysis of the stability of a fixed 1-periodic stationary 
solution U of (1.1), recall from above that linearizing the flow of (1.1) about U leads to the 
consideration of the 1-periodic coefficient linear evolution equation (1.10). From Floquet theory, 
one may guess that it would be desirable in analyzing this equation to decompose solutions as 
superpositions of functions having a given Floquet exponent £ € [— 7T, it], i.e., functions e l ^h(-) with 
h G L 2 ([0, l]) P erj as described in Remark 1.3. This may be accomplished using the Bloch transform. 
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Given a function g G L 2 (R), its Block decomposition, or inverse Bloch transform representation, 
is denned as 

(2.1) g(x)= ^ e* x g(Z,x)d£, 

J —IT 

where _ 

g(Z,x) :=J2e 27rikX 9(t + ^k), 
fcez 

and g(z) := ^ J R e~ %wz g{oj)doj is the Fourier transform of g. Note that for any £ G [— 7r, 7r], •) 
is a 1-periodic function, hence, as desired, e^'g(£, •) has Floquet exponent £. 

Letting ,8 : L 2 (R) — >■ L 2 ([— 7r, 7r]; L 2 er ([0, 1])), g g denote the Bloch transform, we readily see 
that for our given linearized operator L, defined in (1.10), and g G L 2 (R) we have B(Lg)(£,x) = 
Lg [<?(£, •)] (x), hence the associated Bloch operators may be viewed as operator-valued symbols 
under B, acting on L 2 er ([0, 1]). Similarly, from the identity B[e tL g) (£,x) = (e tL «g(£;, •)) (x), a 
consequence of (1.11), we find the Bloch solution formula for the periodic-coefficient operator L: 

tL\/\ I „i£x/tLc~ 



(2.2) ( S (t)g)(x):=(e^g)(x)= / e^e"^, ■))(*)#. 

In particular, we see that the Bloch transform B diagonalizes the periodic coefficient operator L in 
the same way that the Fourier transform diagonalizes constant-coefficient operators. 

Using the representation formula (2.2), bounds on the Bloch solution operator e * can be con- 
verted to bounds on the linearized solution operator e tL . To facilitate these bounds, we notice by 
the standard Parseval identity that the rescaled Bloch transform \/2ttB is an isometry on L 2 (1R), 
i.e. 

/ \B(g)(£,x)\ 2 dx di = 27r||5|| 2 2([ _^ ] . L2([0il])) . 

-7T JO 

More generally, by interpolating (2.3) with the triangle inequality, corresponding to the case 
q = 1 and p = oo below, we obtain the generalized Hausdorff- Young inequality ||<?||lp(ir) < 
I|3||l«([-*-,7t],i*([o,i])) for g < 2 < p and £ + ! = 1, which, by (2.1), yields for any 1-periodic 
functions g(£, ■) 

e^'<?(£, r m , < ll5ll^([-7r >7 r],LP([0,i])) for (/ < 2 < p and - + - = 1. 
It is from this convenient formulation that we will obtain our linear estimates. 

Remark 2.1. To keep technicalities as low as possible, we have indeed compelled ourselves to prove 
all our linear estimates using (2.4) alone. The only price to pay is that we are thus confined to 
high-norm estimates, in W k ' p , 2 < p < oo, as a consequence of which, due to the details of the 
nonlinear iteration estimates, bound (1.16) (i) (thus (1.19) (i)) is not expected to be sharp except 
for p = 2. With more work, but in the spirit of the present paper, we expect that one may actually 
prove a (1 -|-t)~ 1 / 2 ( 1 ~ 1 /p)~ 1 / 2 + T > decay (r/ > arbitrary). See Remarks 1.7 and 4.3 of [30] for further 
discussion in the somewhat simpler setting of the reaction-diffusion case. 

Remark 2.2. Another important property of the Bloch transform that we will use repeatedly 
throughout our proofs is that it is well-behaved with respect to 2-scale analysis of a slow modulation 
ansatz. Indeed if g is 1-periodic and h is slow, in the sense that the Fourier transform of h 
is supported in [— it, it], then B{gh){^,x) = g(x)h{^). If no slowness assumption is made but 
still g is 1-periodic, then there still holds B(gh)(£,x) = g(x)h(£,x). Mark that the fact that in 



20 



20 Here and elsewhere, we denote \\g\\L«([-Tr,n],LP([o,i])) ■= (/^ •)llip( [0 ,i]) d C) 
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general high frequencies of h are involved in the low Floquet number part of gh will add substantial 
difficulties to the linear analysis below. This nice property that the Bloch transform separates scales 
may also be used to perform two separate change of frames for fast and slow variables through 
g(£,x) !->■ e^ Cl *g(£,x — C2t). This feature was used in a crucial way in the analysis of reaction- 
diffusion systems carried out in [57], where the proof relied strongly on self-similar techniques such 
as renormalization, requiring the Whitham equation to be essentially reduced to a viscous Burgers 
equation by going into the frame of its characteristic velocity (called linear group velocity) while 
keeping the original equation in the co-moving frame of the wave (the one of the phase velocity). 
This elegant strategy appears to completely break down in the system case considered here, for 
which many linear group velocities are involved. 

2.2. Nonlinear perturbation equations. We now discuss how the introduction of a phase shift 
affects system (1.1). Following [29, 25], we introduce the perturbation variable 

(2.5) v(x,t) = u(x-ip(x,t),t) -U(x) 

where u(x,t) satisfies (1.1) and i[)(x,t) is a phase shift to be determined together with v. We recall 
the following representation convenient for nonlinear iteration and established in [29]. 

Lemma 2.3 ([29]). The nonlinear residual v and phase shift ip linked by (2.5) satisfy 

(2.6) (d t - L)(v + ^U X ) =M, with M :=d x Q + d x K + d t S, 
where 

(2.7) Q := -k [f(U + v)- f(U) - df(U)v) , 

(2.8) K:=-vfr + Pv x -^ + PU x -^Z-, 

i - ip x i - i>x 

and 

(2.9) S := w/> x . 

Remark 2.4. As noted in [29], the advantage of (2.5), u(-,t) = (U + v(-,t)) o *(•,£) (with 
*(■,£) = (Id — ^(^i)) -1 ) over ^e (probably more natural) choice u(-,t) = U o *(-,i) + v(-,t) 
suggested by (1.6), is that the phase shift enters the equation only through commutators between 
composition with ^ and differentiation, hence only gradients of ip appear in the source terms 
Q,1Z,S on the right-hand side of (2.6). By contrast, the corresponding terms for choice v would 
involve also terms of order |V'||V'a:| including the nondecaying phase ip itself, thus making the decay 
too slow for our nonlinear iteration to close. This observation is by now classical in the stability 
analysis of traveling patterns; see the related observations of [15] regarding stability of periodic 
reaction-diffusion waves, and [71, 72, 42, 23] and [65, 66, 67, 73] 21 for similar, earlier, observations 
in the context of viscous shock stability and bifurcation. 

However, notice also the more subtle aspect of the decomposition 2.6 that it groups within the 
term (dt — L)(ipU x ) appearing on the left-hand side the linear order source terms iptU x and ip x U xx 
that are individually too large to handle in our later nonlinear iteration; the term (dt — L)(tpU x ) 
is then used to cancel instantaneous phase-modulations arising in the solution of the linearized 
equations, as described just below. This approach to detecting nonlinear cancellation originates 
from the study of stability of viscous shock solutions of systems of parabolic conservation laws (see, 
e.g., [71, Eq. (2.30)], [72], and especially [23, Eq. (5.23), Cor. 5.4, p. 453]) and is fundamentally 
different from those introduced in the reaction-diffusion setting in [15, 57] based on normal forms 
and successive reductions/renormalizations, which, as discussed in Remark 2.2(h), appear unlikely 



21 Specifically, [65, Section 3] and [66, Section 2.2.1] (group invariance and uniqueness), [73, Section 3] (translation- 
invariant center-stable manifold), and [67, Theorem 2.2.0] (Nash-Moser uniqueness theorem). 

18 



to work in the present case. Indeed, we view this distinction as the key to the successful treatment 
in [29, 30] of nonlinear modulational stability in the presence of conserved quantities. 

Isolation of the phase. To motivate the more technical analysis of Section 3, we describe in infor- 
mal fashion the way that we determine the phase ip, separating out principal nonlinear behavior. 
Using Duhamel's formula together with (2.6), we can write an integral equation for v as 

(2.10) v(;t)+i/>(-,t)U x = e tL (v(-,0)+i;(;0)U x )+ f e^ L N{s)ds, 

J 

with initial data 0) = ho, v(-, 0) = uq o (Id — ho) — U. Noting, by the bounds of Section 3, that 
e tL = U x e n+ i ■ s p (t) + S(t), where s p (t) is some operator sending lR n -valued functions into R n+1 - 
valued functions 22 (e n +i denoting the (n+l)th standard basis element) and S(t) is a faster-decaying 
residual, that is, that the principal part of the linear solution operator is a linear phase-modulation 
consisting of e„+i • s p (t) times the instantaneous shift U x , we remove this principal part by defining 
implicitly 

(2.11) ip{t) ~ e n+1 • s p (t)«, 0) + V(-, Q)U X ) + f e n+1 • s p (t - s)N{s)ds, 

Jo 

where the ~ here indicates equality for t > 1. This gives an expression 

(2.12) v {t) ~ S(t)(v(-, 0) + V(-, 0)U X ) + f S{t - s)Af(s)ds 

Jo 

for v in which no s p (t) terms appear, closing the system in (ip,v). 

This simple prescription follows the principle that we should choose the nonlinear phase so as 
to remove from the linear description of the residual v all contributions representing linearized, 
or "instantaneous" phase modulation, at least away from the initial layer < t < 1. On this 
latter time interval, where we are constrained by the restriction ip{-, 0) = ho, we instead interpolate 
between the right-hand side of (2.11) and the initial data 0) = ho, as described in (3.11)-(3.13). 

2.3. Nonlinear damping estimate. To complement the linear bounds, established below, that 
form the core of the proof, we will use the following damping-type bound established by energy es- 
timate in [29], useful in controlling higher derivatives by lower ones, enabling us to close a nonlinear 
iteration with decay rates of the lower derivatives. 

Proposition 2.5 ([29]). Assuming (Hl)-(HS), there exist positive constants 9, C and e such that 
if v and ip solve (2.6) on [0, T] for some T > and 

sup \\(v,i> x )(t)\\ H K, R) + sup \\ip t (t)\\ H K-i m < e 
te[o,T] te[o,T] 

then, for all0<t<T, 
(2.13) 

IK*)II^ (K) < Ce- et \\v(0)\\ 2 HK{R) + C [ e- e ^ (\\v(s)\\ 2 L2(R) + \\^ t ,^ x ,^ xx )(s)f HK . 1{R) ) ds. 
Proof, (from [29]) Rewriting (2.6) as 

(2.14) (1 - Va>t - ~k 2 v xx = kcv x - MUx + v x ) ~ Hf(U + v)- f{U)) x + k {^f{U x + v x ) 

22 The first space is the one of [/-values, the second one is the one of modulation parameters (M, k). 
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( — iy Q 2 i y 

taking the L 2 (M) inner product against Yl?j=o r — > integrating by parts, and rearranging, we 



obtain 

j t \H 2 HK(m {t) < -e\\a? +1 v(t)\\h m + c (\Ht)\\% K{m + m,^,^ x )(t)\\ 2 H K- Hm ) , 

for some positive C and 9, so long as \\(v, v x , ipt, ipx, 4 ; xx)(t)\\H K - 1 (R) remains sufficiently small. 
Sobolev interpolation Hsil^-if ( R ) < a_1 ll^^sll^^) + a ll9 , llz / 2( IR - ) gives, then, for a > sufficiently 
large, 

j t \H 2 H K {R) (t) < -0\\v(t)\\ 2 HK{R) + C (||w(t)||£ a(R) + ||(^,^)(t)||^ (M) ) , 

from which (2.13) follows by Gronwall's inequality. See [29] for further details. □ 

Remark 2.6. It should be pointed out that one technical feature of our nonlinear iteration is 
the loss of derivatives of the nonlinear perturbation variable v, i.e. our iteration argument shall 
control L p norms of v in terms of H s norms of v and gradients of ip. This loss of derivatives 
is compensated by the above "nonlinear damping" estimate, which is due to the (here, total) 
parabolicity of the governing equations. Of course, here, other strategies would be available (such 
as maximal regularity). The advantage of the above energy estimate is that it generalizes to 
partially parabolic systems (through the introduction of Friedrichs symmetrizers with Kawashima 
compensators; see [33] for such an analysis in the context of the Saint- Venant system). In contrast, 
the fact that the Bloch transform involves only bounded Floquet numbers translates immediately 
into the fact that local parameters such as ip (or M) should be "slow," in the sense of Remark 2.2, 
so that arbitrarily many derivatives are gained at linear level. 

2.4. Inverse modulation bounds. An important technical detail [15, 25, 26], and a new issue 
beyond the viscous shock wave case mentioned in Remark 2.4, is the relation between quanti- 
ties \\u(- - ip(-,t),t) - [/ J ^+ JW "(-> t )M 1 -' ! M-,*))( . )|| iP ( R ) conveniently estimable by our analysis and 
the corresponding quantities ||it(-,i) — U M ^' t ^ K( -'' t \^(-,t))\\ LP ^ arising through formal Whitham 
approximation. With this in mind, we remark that the following (sharp) estimate shows that 
||F(-)-G(-+0(-))||lf(]r) is essentially equivalent to ||F(--</>(-))-G(-)||lp(r) phis ||</>||l°°(r) II<Mlp(r)- 

Lemma 2.7. Let (f> be bounded with \\4>x\\l°°(®i) < 1- Then Id — (j) is invertible and 

_. i 
\\F - Go (Id- (f>) \\lp(m) < (1 + \\<Px\\l™(r)) p \\Fo (Id- <t>)- G|| LP(K ) 

(2.15) \\F-Go(Id + 4>)\\ LPm < (l + U x \\ L oo m )v \\Fo(Id-<t>)-G\\ LPim 

+I|G , :t||l°°(r)(i + II0x||l°°(r)) p II</>IIl°°(r)II<Mlp(r)- 

Proof. By the implicit function theorem and boundedness of 4>, the map Id — <f> is invertible. Let 
us write its inverse Id + cb. Since the Jacobian of Id + <f> is bounded below by (1 + ||<Ax||l°°(r)) _1 , 
we remark that 

\\[ F o (Id - 4>) - G]o (Id + 4>)\\ LP{R) < (1 + Uxh-m)? \\f o (Id - 4>) - G\\ LP(R) 
and the first part of (2.15) follows. We then split F — G o (Id + (f>) as 

F -Go(Id + (f>) = [F o (Id- (/)) - G]o (Id + 4>) + Go (Id + 0) - Go (Id + (/)). 
Now the intermediate value theorem yields 

||Go (Id + 4>) - Go (Id + 0)|| LP(K) < ||G x || L oo (K) ||<^- 4>\\ lp( r). 
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But, from equality <fi = <fio (Id + (f) we infer 

(f>(x) — (p(x) = (f>(x) / 4> x (x + tcj)(x)) dt 
Jo 

from which Holder's inequality yields 

H-n p LP{m < MWloow f Q ||^o(id + ^)||^ (M) ^. 

This concludes the proof SinCG \\0 \\ T oo /"TQ>\ < ||0||x,oo (m) an d, for t £ [0, 1], Id + t(j> is invertible with a 
Jacobian bounded below by (1 + ||0x||i,°°(r)) -1 - See [26, Remark 1.4] for related comments. □ 

Remark 2.8. Note that the quantity u{- —ijj(-,t),t) estimated in our analysis does not necessarily 
control u(-,t) unless kip x is small in L°° (local invertibility of Id — tp) and bounded in L 1 (global 
invertibility) . Thus, our approach (and likewise that of [25, 26, 15, 57]) is inherently a small- 
variation analysis in wave number, whether or not the Whitham system admits large-variation 
solutions. 23 

However, if the Whitham system has an associated convex entropy [39, 64], then, for k — k ~ kip x 
initially small in L°° and bounded in L 1 (hence small in L 2 ), by the results of [22], it has a 
solution that remains small in L°°, and decays as (1 + i) _ 2( 1_1 /p) in all LP, whence, combining 
the stability and behavior arguments of Sections 3-4 and closing an iteration for z (the refined 
perturbation variable defined in (4.7)) instead of v, it might be possible to relax the assumption 
||V , x (0)|| i inL° o < 1 to IIV'zWIIl 1 = 0(1) and HY^O^U 00 <C 1, allowing large variations in phase. 
In the absence of a convex entropy, one might instead assume closeness of V'x(O) i n L l n H 8 to a 
special "diffusion wave" solution (in the scalar case, a distorted Gaussian obtained by Hopf-Cole 
transformation; in the system case, a more complicated coupled superposition of such waves [41]) 
that has evolved sufficiently long to be of small gradient, again relaxing slightly the restriction that 

be of small initial variation. These would be interesting directions for future investigation. 

3. Stability 

In this section, we prove Theorem 1.10. In particular, to separate further (2.10), we first need 
a precise spectral analysis (proving Proposition 1.7 along the way) that will allow for a separation 
of e tL into a part aligned with U x plus a faster-decaying term. Throughout this analysis, we shall 
often refer to algebraic relations obtained from the profile equation (1.3) and stored in Appendix A. 

3.1. Spectral analysis and nonlinear decomposition. 

Proof of Proposition 1.7. By (A. 3), one may choose {pM-JJ, . . . , dM n U, U') as a right basis for So, 
and (ei, . . . , e n , u adj ) as the dual left basis, where ej denotes the constant function equal to the 
jth standard Euclidean basis element, and u ad i denotes a generalized zero eigenfunction of the 
adjoint operator L* = Pd 2 + k(df(U) - c)d x such that (u adj ,d M U) = and ,V) = 1. By 
standard spectral perturbation theory [35] we may build spectral projectors and appropriately 
extend (locally near £ = 0) these dual bases in an analytic way into dual right and left bases 
{9j(0}j=i an d {qj(0}]=i associated to the spectrum of in some fixed neighborhood of the 
origin. This spectrum is then precisely the one of the matrix 

(3.i) A f = mo^MO))^, 

which we expand as £ — » as 

A f = Ao + (ifcOA (1) + (ifc0 2 A (2) +0(^ 3 ). 



^In particular, for the three example systems considered in the introduction, the Whitham system does have 
large-amplitude solutions for data merely bounded in L 1 n L 2 since it has an associated convex entropy; see [22]. 
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We expand also L f = L + (ik^L^ + (ik^) 2 L^; see (A.2) for definitions of the L®. 
Note that, replacing, if necessary, simultaneously q n +i{0 with 

n 
J'=l 

and, for j ^ n + 1, with (?j(£) + ^{qj(0),d^q n+ i(0)) q n+ i(£,), we may assume without loss of 

generality that, for j 7^ n + 1, 

<&(0),% n+ i(0)> = 0, (a^-(0),g n+ i(0)) = 0, 

the second inequality stemming from the first by expanding to first order in £ the duality re- 
lation (qj{0iQn+i(0) = 0- Now note also that since Ln</ n +i(0) = 0, expanding to first or- 
der in £ the fact that L^q n+ i(^) lies in := Span{<fy (£)}"+! the critical space of Lg, we find 
that Lod^q n+ i(0) + (ik)L^q n+ i(0) lies in So, the generalized kernel of Lq. Using now (A.3) (iii) 
yields that Lo(d^q n+ i(0) — (ik)dkU^^) lies also in the generalized kernel of Ln, thus so does 
d£q n +i(0)—(ik)dkU\(M,k)- Orthogonality relations from above and (A.3) (iii) lead then to d^q n+ i(0)— 
(ik)d k U m - k) eCU'. 

The above preparation yields the following representations for Ao and the first- and second-order 
correctors of the matrix A^: 



An 



Onxn n xl\ / n xn n xl 

(q n+1 (0),L oqi (0)) J ~ \d M u\ { M,k) 



A (2) 



A (i) = /^(^•(0),L ^(0)) + (g,(0),L( 1 ) % (0)) nxl 
1 " ^ * i(g„ +1 (0),^o%n+i(0)) + (g n+ i(0),L (1) fe+i(0)) / 

= (-([d M F)\ { M^)-clA) nxl \ 

V * -Hdkc)\ { M,k)J ' 

(* ^(^(0),L % n+ i(0)) + i(^(0),L( 1 )g n+1 (0))\ 

+i(g,(0),L«% n+1 (0)) + (^(0),LP), n+1 (0)) = * ^ ^ 

\* * y v 

Therefore, there is no loss of regularity in £ under the scaled similarity transformation 
and 

Aq = {-((dMF)\(M,k)-ad) -(dkF)\(M,k)\ 

V -KdMc)\ ( M~ k) -H d kC)\(M,k)J 

This gives the result in a straightforward way. For omitted details, we point to similar computations 
in [48, 49, 32]. Recall also that this result is proved at spectral level (i.e., not including matrix 
expansion or eigenvector information) by an Evans function approach in [62, 52]. □ 

The former proof provides more than stated in Proposition 1.7. We collect supplementary infor- 
mation in the following lemma. 

Proposition 3.1 ([49]). Assuming (Hl)-(HS) and (Dl)-(DS), there exist e > 0, £ G (0,7r), n + 1 
analytic curves, j = 1, . . . , n + 1, Xj : [— £0, £0] — > B(0, en) such that for £ G [— £0, £0] 

a(L € )n 5(0, £ ) = { Aj(0 I j G {l,...,n + l} } 
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and for 9 > as in (D2) 

(3.3) \ J (0 = -ika j £ + (ik) 2 b j + O(\ti\ 3 ), aj,bj real, ~k%>9, 



and associated left and right eigenf unctions 4>j{0 an d <f>j(0 satisfying pairing relations 

(3.4) (^(0, MO) = ikSl 1 < j, k < n + 1, 
obtained as 

n 

ho = m)J2Pi j) (OQi(o + $Uo Q n+ i(o 

(3.5) r l =1 

MO = EA a) (Offl(0 + ( fk 0^U0 Qn + i(0 

1=1 

where 

• (li(0> ■ ■ ■ > 9™+i(C)) tfid (<7i(£)) • • • ) <Zn+i(0) are ^ Ma ^ bases of spaces associated to the spec- 
trum of respectively and its adjoint L| in B(0,e ), analytic in £, bifurcating from 

(<9mi^, • • • , dM n U, U') and (e±, . . . , e n , u ad ^) at £ = 0, rat/i i/ie constant function equal to 
the jth standard Euclidean basis element and u adj a generalized zero eigenfunction of Lq 
such that (u ad ^ , 8mU) = and (u adj , U') = 1, and chosen such that 

(3.6) ($,-(0),%„+i(0)>=0, (0^(O), gn+ i(O))=O, l<J<n; 

• (/? (1) (0,...,/3 (n+1) (0) and (/3 (1) (0, • • • ,/5 (n+1) (0) are dual right and left eigenbases, an- 
alytic in £, o/ i/te matrix A^, defined in (3.1)-(3.2), associated to eigenvalues \j(£,)/(ik) 
and in particular for £ = i/iey /orm dua/ ng/ii and Ze/t eigenbases associated to ajs of 
d ( F >-u)\(M,k) - eld- 

Proof. This is a direct consequence of the proof of Proposition 1.7 except for the conditions on 
\j(0, a i> bj, which follow from (D2) and complex conjugate symmetry together with (H3). □ 

Remark 3.2. Scaling transform (3.2) is directly related to the fact that the equation for the time 
evolution of local wavenumber is obtained by differentiating once in space the equation for the local 
phase. Though unnecessary to prove Proposition 1.7 in the uncoupled case, since the proposition 
follows then already from an examination of A^ 1 ), the manipulation is correct regardless of the 
linear coupling assumption. Yet in the uncoupled case, we may also assume ^ n+1 \0) = and 

Pi+i(°)_= for 3 + n + !■ Th en by replacing 4> n +i(0 witn (^0 -1 ^n+i(0 and > for 3 + n + 1, 4>j(0 
with (ik£)~ 1 (f)j(£,), we obtain dual critical bases, analytic in £. For localized data this difference 
translates at the linear level into different decay rates. Note also that, even in the linearly uncoupled 
case, manipulations similar to scaling must be performed to get information about eigenf unctions. 

Next, following [30, 25], in view of (3.5) (i) with q n+ i(0) = U', we decompose the solution operator 
S(t) = e tL as 

ra+1 

(3.7) S(t) = S*{t) + S(t), S*(t) = U' e n+1 ■ s*(t), s*(t) = £ s^(t) 
with 

(3.8) (s*(t)g)(x) = |\^a(Oe^^^^^(6(^(C,-),5(e,-))L 2 ([o ! i])^, 
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and 
(3.9) 



(S{t)g)(x) := ^ e**(l-a(0)(e tL «S(0)(*R + f e* x a{0{e tL *IL(Og(Q)(x)dt 

J— 7T ./— 7T 



where a is a smooth cutoff function such that < a < 1, a(£) = 1 for |£| < £o/2 and a(£) = for 
|£| > Co, and 

n+l 1 n+l 

(3.10) ip(o -^qjiOm&Mio,!]) = tttE M0(M0, Mm)> fi ® :=Id - nP (6 

denote respectively the eigenprojection, defined for |£| < £o 5 onto the critical space 

£ e = Span^)}^ 1 

bifurcating from So at £ = 0, and its complementary projection. 

To establish nonlinear stability, there are of course other natural splitting choices available. In 
particular, the fact that we have kept in (3.8) the full /3^(£) instead of /3^(0) alone will play a 
role only in the asymptotic behavior study. 

Next, we transpose this linear decomposition to the nonlinear level. Recalling Lemma 2.3 and 
integral equation (2.10), we start with the equation 

(d t - L)(v + tpU') = M, v\ t =o = d , ip\ t =o = h , 

for the nonlinear residual v(x,t) and the phase shift ip(x,t) both introduced in (2.5), where do '■= 
M- ~ MO) ~U € L\R) n H K (R), d x h € L^R) n H K (R), and notice, as in (2.10), that, after 
denoting solution operator S(t) := e tL , an application of Duhamel's formula leads to 

(3.11) v(t) + ip(t)U' = S(t)(do + h U')+ I S{t- s)N{s)ds. 

Jo 

In order to simultaneously accommodate the initial datum constraint i/}(0) = ho and absorb as 
much as possible e n+ i ■ s p (t) contributions into the equation for ip, as described in the discussion 
surrounding (2.11)-(2.12), we split (3.11) as 



(3.12) 



and 



(3.13) 



iP{t) = e n+1 • s p {t)(hoU' + d )+ [ e n+ i • s p (t - s)M(s)ds 

Jo 



- (1 - X(t)) (e n+l ■ s p {t)(d + hot)') - ho + J e n+1 • s p (t - s)M(s)ds^ , 

v (t) = S(t){d + h U') + f S(t - s)M(s)ds 

Jo 

+ (1 - x (t)) (s p (t)d + (S p (t) - U)(h U') + J S p (t - s)M(s)ds) , 



where x(t) is a smooth cutoff that is zero for £ < 1/2 and one for t > 1. We may extract from 

(3.12) -(3.13) a closed system in (v,^ x ,if) t ) (and some of their derivatives), and then recover ijj 
through the slaved equation (3.12). 

We proceed stating now the linear estimates needed to bound the terms appearing in (3.12)- 

(3.13) . 
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3.2. Basic linear estimates. 



Proposition 3.3. Under assumptions (Hl)-(HS) and (D1)-(D3), for all t > 0, 2 < p < oo, and 
any I, m > 0, r > 1, 1 < j < n + 1, 



d l x d?s*{t)g 



< 



Lp(R) 



mm 



| (1 + t) -^(l/2-l/ P )+i- ^ 



(3.14) 
w/ien i + m > 1, 

(3-15) || sj(t)£? ||l°°(R) ^ lbllLi(R)nL 2 (R) 

w/ien (I = m = and p = oo), for some r] > 



L 1 (M) 
5 ^lbllL2(R) 



(3-16) \\e n +i- s p (t)g\\ LP{R) < min 

w/ien (I = m = and 2 < p < oo), and 



+ 



(3.17) 



< 



mm 



■ (1 + i) -i(i-i/ P )-^P||^ 
(1 + £ )-|(V2-l/p)-to| 



while, for some rj > 0, < I + 2m < K + 1, and 2 < p < oo, 



(3.18) 



and, for 1 < r < K + 1, 
(3.19) & x d?S{tW x g 



LP(R) 



< min 



< min 



^n^n^ +2m+1(R) + (i +r^ i - i/p) ii5iiL 1( R) 



-»/t||«r 



^IIh'+2™+1(R) + (1 + *) 2 



i(l/2-l/p)i 



Il 2 (r) 



e ^ll^lln'+^ifR) + (! + *)" 2Vi ^ 2 lltfllLi(iR) 
e-*\\d x g\\ Hl+2m+lm + (1 + t)-|(V2-i/p)-| || 5 || L2(R) 



Proo/. (i) (Proof of (3.14);. First, notice that 



(3.20) (d l x d?s p Jt)g)(x) 



\(Oe A ^^^(^ i A^O m /3 ' ) (e)(^(0,5(0)L 2 ([o,i])^. 



In the case I + m > 1, estimates on s p follows from, choosing either s = 1 or s = 2 fixed and 
introducing s' such that 1/s + 1/s' = 1, the generalized Hausdorff- Young inequality (2.4) by 



(3.21) 



x _> f W e ^ a{0 e^0t 1 (,O'A,(O m /3 a) (O(^(O,5(O)L 2 ([0,i])^ 
< II «(Oe A(5)t |er +m ' 1 K^(0,5(6)L 2 ([o,i])l IU«([- Wlff ] 1 LP([o,i])) 



< \\t* IZr™-^ *|| LP( .„ ([ _ >W]) I!? ^ a(0 1/2 K^(0,5(0)L 2 ([o,i])lll^( h ^]) 

< (i +t ) -i(v-iM-^r i || C _> a (o 1/2 K^(o,5(0)L 2 ([o,i])iii^( [ - 7r , 7r] ), 

where 1/p + 1/g = 1 and 1/s' + l/r(s,p) = 1/q, so that l/r(s,p) = 1/s — l/p. Here we have used 
(D2) to get for some rj > 0, {e^^a 1 / 2 ^ < e'^ 1 and (1.12) to get = 0(£)- 

Now, for s = 2, we note by the Cauchy-Schwarz inequality and Parseval identity (2.3) that 



< sup ||^-(OIIl2([o,i]) Ml^r), 

ICI<& 
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where £o is given as in Proposition 3.1. For s = 1 on the other hand, we begin by expanding 

<<M6,s(6>L3([o,i]) = £<tej')*s(£ + 2A), 

where (f>j(£,f) denotes the j'th Fourier coefficient in the Fourier expansion of 27r-periodic function 
(f)j(-), and z* = z denotes complex conjugate. Applying the standard Hausdorff- Young inequality 
for the Fourier transform, i.e. ||<7||z,oo(]r) < ||5||li(r), together with the estimate 



« 1/2 (o E \*i(ur\ < « 1/2 (o /E^ + mMuw D 1 + i^i~ 2 ) ^ ca^(mMo\\m m]) , 

f V i' i' 

which readily follows from the Cauchy-Schwarz' inequality, we obtain the bound 

a(O 1/2 |<0j(O.5(O)L 2 ([o,i])IIU o °([-T,7r]) ^ sup ||^j(OIIhi([o,i]) HfllLi(M)- 

l£l<£o 

Together with (3.21), this establishes (3.14). 

(ii) (Proof of (3.15) ). Thanks to (3.3), with the same kind of estimates as above one can bound 
in L°°(R) the difference between s p -{t)(g) and 



x 



e (-ik aj s-k> bj e)t e iZx 1 /3^)(0)(^(0),5(0)l 2 ([o,i])^ 

J— TV l &c, 



by C(l + t)~2|| 5 ,|| Ll(R) . Since <^-(0) is constant equal to v i := 0i\o), . . . ,(3^(0)), 

(^(o),5(6)l 2 ([o,i]) = "j-diO 

and the last term is recognized to be the convolution of Vj ■ g with 

x ^ p. v . [" e (-ia j (-Pi i at^^^(j)/ () ^ = pji f ei-ikait-m^tjtx^pij),^ 

J-TT lK JR lk£ 

- [ e (-ifca J -€-* 2 6i€ 2 )* e *€x ^L^C?) (0)de- 

7R\[-7r,ir] 

By using the Cauchy-Schwarz inequality, we may bound the last integral in L 2 (IR) with Ce~ vt , for 
some 7] > 0. The remaining principal value is explicitly computed as a Gaussian error function 

S>)(0) errfn [ 

and thus is bounded in L°°(R). This achieves the proof of (3.15). 

(in) (Proof of (3.16),). We first remark that one can bound in L P (R) the difference between 
e n+ i • s p (t)(g) and 

x— ► T e^^ e (^?-^ 2 )*i- en+1 .^)(0) v r g(0d£ 

J-TT - = 1 

by C(l + i) _ 2( 1_1 /p)||(yr|| L i ( - R - ) . Since {/3j(0)}j and {(3j(0)}j are dual bases, it follows, with Uj still 
denoting the constant value of 4>j(0), that 

n+l n+l 



J'=l J'=l 
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Iddxd\ /Iddxd 
0---0/ V0---0 



and therefore 



n+l 



V 3 







0). 



e n+1 .^/3^)(0) 
j'=i 

Thus, the quantity to bound may be written as the convolution of g with the kernel 



/7T " 

Observing that 



"< e (-ika^-k 2 bje)t _ e (-ifca 3+1 C-fc 2 6 J+ iC 2 )t 



e (-ikaj£-k 2 bj€ 2 )t _ e (-ika j+1 £-k 2 b j+1 £ 2 )t 



provides (3.16) (i). 

To get (3.16) (ii), we first write the kernel (3.22) above as 



< t 



f J 1 j2 e ^H(^ i±i ^^ i±i ) t ) t 



X 



Oi — a 



'i+i 



+ ik^ibj - bj + i)r 



x e 



!>,■-!>,■ 



Integration by parts in the £ variable shows that for any fixed e > the contribution to the L p 
norm of the kernel of points x lying outside 



n 



\l<j<n+l 



min cij — e ) , t max ou + e 



l<j<n+l 



is bounded by 



et V 



i/p 



< at 1 /?. 



Following the end of the proof of (3.15), we may write the remaining part of the convolution as the 
convolution of g with a term decaying exponentially in time in L 2 (R) plus its convolution with a 
bounded function supported on If. This yields (3.16) (ii). 

(iv) (Proof of (3.17) ). The proof goes similarly as the one of (3.14) thanks to the fact that since 
4>j(0) is constant, for r > 1, the equality 

<<MO,(W(0>L 2 ([o,i]) = £ (l)(iO r, (4>j(0,d r x - r ' 9(0)^10,1]) 



a-i) r (d r x 



r'=0 x ' 

' MO -Mo) 



holds and provides the factor £ needed to compensate for (ik£) 1 . 

(v) (Proof of (3.18) and (3.19) ). The last part of (3.9) is bounded as sj, with an extra £ factor 
compensating for {ik^)~ l thus enhancing decay and allowing for / = m = and 2 < p < oo. We 
focus on the two remaining terms of (3.9). 
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To treat these, it is convenient to introduce on H l ([0, 1]) a family of equivalent norms parametrized 
by £ G [-vr,vr], 

I 

Hf llm ([ o,i]) : =EH^ + ^lli 2 ao,i]) 

i=o 

so that Parseval's identity implies 

Ibll^(R) = (2?r) ||€ •-»• I|S(0 ll^([o,i])) ll^d-^])- 

Now, thanks to standard resolvent bounds [21], assumptions (D1)-(D3) and the fact that H l+2m+1 ([0, 1]) 
and L 2 ([0, 1]) spectra coincide [19], we may use Priiss' Theorem [56] and obtain that for some rj > 

|e* L «(l - a(0)\ ti i+2 m +i m]) ^ ti i+2 m +i m]) , \a(£)e tL z U(0\H^+^[o,i])^H l + 2m+1 ([o,i]) ~ 

Therefore 



J —IT 

J — 7T 



^||£^ ||p(0llH' +2m+1 ([0,l])) IU 2 ([-7r,7r]) ^ e ^Hfllff'+^+iQR)- 

Since, for 2 < p < oo, # i+1 (IR) is embedded in W l,p (R), this completes the proof of the proposition. 

□ 

Remark 3.4. We have included (3.16) (ii) to give a better account of large-time behavior of the 
phase. Yet, in our nonlinear stability analysis we use (3.16) only to ensure that the initial time 
layer remains localized, thus only for intermediate times 1/2 < t < 1. For this purpose, (3.16) (i) is 
sufficient. The proof of the latter estimate is easier to obtain and involves only frequency arguments 
in the spirit of the rest of the paper. 

3.3. Linear modulation bounds. The above basic linear estimates are sufficient to control all 
terms in (3.12) and (3.13) in the localized case when ho = 0. To control the additional terms arising 
from the nonlocalized initial phase shift hoU', we use the bounds in the following proposition. Here 
and elsewhere, we suppress the dependence of bounds on norms of the background periodic wave 
U, the periodic right and left eigenbases 4>j an d <j>j, or other known periodic functions, which, by 
(HI), may be seen to be as smooth as needed for the H s ([0, l]) per bounds our arguments require. 

For functions h with localized derivative, we will use repeatedly h(£) = jfd x h(£). From now 
on, to make this possible, we will assume that all such functions are centered, in the sense that 
h(—oo) = —h(oo) (which includes the case where h is itself localized and both terms vanish). In the 
statements of Theorems 1.10 and 1.12, it corresponds to the assumption that the global phase shift 
ipoo is 0. Of course, in doing so, one does not lose in generality since this is achieved by replacing 
U, ho, ip, etc. with U( ■ - ipoo), h - ip^, ip - ip,*,, etc. 

Proposition 3.5. Under (Hl)-(HS) and (Dl)-(DS), for all t > 0, 2 < p < oo, I, m > 0, 

1 < j < n + 1, 

(3.23) \\d l x drsV(t)(h U')\\ LPm < (l + t)-^-Vr)H- l -^\\ dx ho\\ Lim 

when I + m > 1, 

(3-24) || S?{t)(h U') 1 1 loo (k) < 1 1 5a; /to 1 1 L 1 (]R)nL 2 (M) 
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when (I = m = and p = oo ), 

- f i 
(3-25) || e n+ i • s p (t)(/i C7 ) - h \\ L p(r) < (1 + t* 1 ) \\d x h \\ L i(M.)nL 2 (R) 

when (I = m = and 2 < p < oo), and, for < I + 2m < if + 1, 

(3.26) \\d l x d?S(t)(h U')\\ LPm < (l + t)-^-yp)\\d x h \\ LlmnHlMlm . 

Proof. 24 (i) (Proof of (3.23),). This follows applying the arguments of the proof of (3.14) in 
Proposition 3.3, once we have established the estimate 

||^^Q(0 1 / 2 |(^(0,(/»0^ , )''(0)L»([0 ) 1])IIU«»([- W)1 r]) 



< 



sup ||^0j(OC/'||li([o,i]) + sup ||^j(0^'I|l2([o,i]) II^xMlM 



The latter bound stems from first re-expressing 

«(0 1/2 ^(e),(/ l of/ / )^(0)L 2( [o,i]) = -ia(o 1/2 ( ^ (0 :^ (0) ,^) WO 

\ / L2([0,l]) 

where {<j>jU')(£,j') denotes the j'th Fourier coefficient in the Fourier expansion of periodic function 
</>j (£){/', then applying a Haussdorff- Young estimate, the Mean Value Theorem, Cauchy-Schwarz' 
inequality, and Parseval's identity. 

(ii) (Proof of (3.24) ). Thanks to (3.3), since (j>j(0) is constant equal to Vj, with the same kind 
of estimates one can bound in L°°(R) the difference between s?(t)(hoU') and 

J —IT 

by C(l + ||5 x /io||ii(R)- Now note that, for £ € [— vr,7r] 
<[/', [ho(0 - MO])l2([o,i]) = -(U,d x (h )(0)m[ ,i]) 

= -(U, (d x h )(0) L 2([o,i]) + i^djToiO + [MO - MO])lH[o,i]) 
= -{(Ud x h ) - (0) + (U)^ho(O + [MO - MO])l»([o,i]) 

leading to 

(3.27) (u'Mo - MOD = -[{u- (u))d x h r(0 + m, [MO - M6])^ ([ o,i]). 

Again the extra £ factor makes the contribution of the last term negligible so that we are left with 
proving a L°°(R) bound on 

x ^ r e^^^^e^i-^^^hi^^-CO), E/V([o,i])^o - ^ ■ (tf - {U))d x hor 

J —IT 

Then the proof of (3.24) is achieved as was the one of (3.15), writing the main contribution as a 
term exponentially-decaying in time plus a convolution with an explicit errorfunction. 25 



24 Compare to the similar but much simpler argument of [25, Proposition 4.1], in the reaction-diffusion case. 
25 Up to the explicit computation of the final convolution kernel, the arguments of the proofs of (3.15)-(3.16) and 
(3.24)-(3.25) are the ones refined to obtain (4.24). 
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(Hi) (Proof of (3.25);. The L°°(M) bound follows from (3.24) and ||/io|U°°(R) < \\d x h \\ L i m . By 
interpolation we only need now the L 2 (R) bound. 
Since || £ i — > l^" 1 ||L2( K \[_ 7ri7r ]) < 1, 



ho(x) - r e* x MOdli 

J —IT 



< 



L 2 (R) 



\\d x ho\\ 



Combing this with arguments of the proof of (3.24), we bound in L 2 (R) the difference between 
e n+1 • s p (t)(h U') - h and 



x i-+ 



/7T 
e* x 
-It 



n+1 



n+1 



/7T "T-l 

3=1 



da 



by C||^^o||Li(R)nL 2 (R)- I n the sum the latter term is bounded following the proof of (3.16) (ii). To 
bound the first term, we first observe that, thanks to (3.6), 



(d^(0),£/') i2([0)1]) = (<%<^(0),g n+ i(0)) i2([0)1]) 



«fc©i(0) = ifc/P(0) 



e n +i 



therefore 

n+1 

ik 



3=1 



1 n+1 n+1 

i=i 



2n+i — e n+ i • e n+ i — 1. 



Then the function to bound is written 



x i-+ 



n n+1 t-ika j £,-k 2 b j £, 2 )t _ i 



/7f ft-T-L 
7=1 



tf +1 (o)^ +1 (o) fc^m 



and is bounded in L 2 (R) splitting 



e (-ikaj£-k 2 bj£ 2 )t _ i e (-ikaj£-k 2 bj£ 2 )t _ e -k 2 bj£ 2 t e -k 2 b^ 2 t _ ^ 



+ 



and using both 
with 



e -*V 2 *-i 



ik£ 



n ■■- 



< y/i and the arguments of the proof of (3.24) (ii) where If is replaced 



1 (i#<^+i (a ^-- £ 



' 1 imnJ a ' )++£ 



(_ and + denoting negative and positive parts). 



(iv) (Proof of (3.26) ). The last part of (3.9) is bounded as was s?, with the usual improvement 
in integrability and decay coming from the extra £ factor. Once we have proved 



lie ^ \\u'[h (O - MO]llfl£+*»+i([ 0) i])) 
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Il 2 ([-7t,7t]) < 11^x^0 1| ffi+2m+l( K ), 



the contribution of the high frequencies of ho to the remaining terms is estimated following the 
proof of (3.18). Since, for £ G [— ir,ir] there holds 

\\U'[h (O - MO]\\hI+^+\[o,i])) < E ll e2wJ "" ^'ll^+^([o,i]))IM^ + 27r /)l 

< (1 + Ht/'H^+iuo,!]))) + 2vrjY +2m+1 N(e + 2tt/)| 

< /El(^"^M^ + 2 7 riO| 2 , 

V 3'*° 

by squaring and integrating we obtain the needed bound. 

To deal with low-frequency contributions, we use the following refinements: 

|e* L «(l - a(0)l^+2™+i([o,i])^^+ 2 ™+i( [0 ,i]) ~ \^ e ~ Vt 

H0 1/2 (n(£) - n(o))| i ji + 2m+i ([0il]) ^. i ji+2m+i ([01]) < |£| 

(for some r? > and all £ G [-7r,7r]). Then, since U(0)U' = 0, 

a(0 1/2 mu' = «(O 1/2 n(6(n(O-n(0))c7', 

and, by following the proof of (3.18), we reduce the bound on the remaining terms to 

II? ^ ll^'llH^ +2m+1 ([0,l])) l£IIMf)l llL 2 (h7r,7r]) i$ II^^O ||l 2 (R)- 

□ 

Remark 3.6. Unlike what occurs in the linearly decoupled case, linear bounds for a localized initial 
datum do G L 1 (M) or a nonlocalized one of shift type hoU', d x ho € L 1 (M) provide in general the 
same decay rates. As a consequence, once these bounds are proved, the proof of nonlinear stability 
is identical to the one for localized perturbations [30]. In particular, the slow decay rate due to the 
Jordan block is compensated for nonlinear terms by the fact that they come in flux form. 

3.4. Nonlinear stability. From differential equation (2.6) together with integral equation (3.12)- 
(3.13), we readily obtain short-time existence, uniqueness and continuity with respect to t of solution 
(v,ipt,4>x) G H K (M) x H K ~ 1 (M) x H K (R) by a standard contraction-mapping argument treating 
most of the terms as sources in a heat equation. Associated with this solution define so long as it 
is finite 

(3.28) C(*) -=supJ\(v,ij t ,^ x )(s)\\ H K mxH K-i mxH K m (l + s) l/4 . 

Combining linear estimates with Proposition 2.5, we now prove an inequality for ( that will yield 
global existence of our solutions. 

Lemma 3.7. Under assumptions (H1)-(H3) and (D1)-(D3), there exist positive constants C and 
e such that if (do, d x ho) is such that for some T > 

E := || (d , d x h )\\ L i {K)nH K (R) < e and ((T) < e 

then, for all <t <T, 

C(t)<C(Eo + ((t) 2 ). 
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Proof. 26 By (2.7)-(2.9) and corresponding bounds on the derivatives together with definition (3.28) 
and equation (2.14) (used to bound v t ), 

W(t)\\^(R)nH^R) ^ \\(v,v x ,v xx ,ilH,ip x ,ip xx )(t)\\ 2 Hl QQ <C({t) 2 (l + ty^, 

(3.29) \\(Q,n,S)(t)\\ LHmm{R) < \\(v,v x ,i: t ,i; x )(t)\\ 2 Hl{R) <Cat) 2 (l+t)-K 

so long as ((t) remains small. Applying the bounds (3.14) (i)— (3.19) (i) and (3.23)-(3.26) of Propo- 
sitions 3.3 and 3.5 to system (3.12)-(3.13), we obtain for any 2 < p < oo 

rt 



(3.30) 



\W)\\v>m < c(i + tyte-WEo + cat? [ (i + t- s )-3(i-vp)-J(i + s) -* da 

■/ 

< Cp (E + at) 2 )(i+t)-^-yp) 

and, with 

(3.31) f s p (t-s)d t S(s)ds = - f d t [s p }{t-s)S{s)ds + s p {0)S(t)-s p {t)S(0), 
Jo Jo 

\m,Mxx)(t)\\ W K-i, Pm <c(i+t)-^ (1 - 1/p) (^ + cw 2 ) 

(3.32) + CC(t) 2 Al + t - s )-3( 1 -Vp)-i/2( 1 + s) -| ds 

JO 

<C p (£ + C(i) 2 )(l + ^ (1 ~ 1/p) - 
Since the hypotheses of Proposition 2.5 are verified, from (2.13) and (3.30)-(3.32), we thus obtain 

(3.33) \\v(t)\\HK(m)<C(E + ((t) 2 )(l + t)- l i. 

Combining this with (3.32) for p = 2, we obtain the result. □ 

We are now ready to prove Theorem 1.10. 

Proof of Theorem 1.10. As already mentioned, short-time existence and uniqueness ensuring con- 
tinuously in time {y^ipti^x) £ H (M.) x H K ~ l (R) x H K (R) are proved in a standard way. There- 
fore, by Lemma 3.7 it follows by continuous induction that solutions are global in time and satisfy 
C(t) < 2CE for t > 0, if E < min({l/4C 2 , e, e/2C}), yielding by (3.28) the result (1.13) for p = 2. 

For any p* < oo, applying (3.30)-(3.32), we obtain (1.13) for 2 < p < p* with a uniform constant 
C. Now rewrite (2.8) as 1Z = 1Z\ + d x lZ2 with 

Ul ■= _ k 2 v ( -^—] + k 2 U x -^, U 2 := k 2 v-_ ^ 



, 1 - ip x J x i ~ l 'v 

Taking p* > 4 and estimating 

(3.34) \\(Q,n 1 ,n 2 ,S)(t)\\ L 2 {R) < \\(v^ t ^ x ^ xx )(t)\\ 2 L4m <CE (l + t)--i 

in place of the weaker (3.29), then applying to integral terms (3.17) (ii) in place of (3.17) (i), we 

obtain, bounding again the dfS contribution using (3.31), 

(3.35) 

\\{Mx^xx){t)\\wK-^iK) < C(l + ty^-^Ev + CE 2 f\l + t - s )-|(i/2-i/p)-i/2(i + S )-Us 

Jo 

< cE (i + tr* (1_1/p) > 
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Compare to the argument of [29, Lemma 4.2], regarding localized perturbations in the decoupled case. 
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for 2 < p < oo. Likewise, using (3.34) together with bound 

||((2,Wl)(t)||H*-i(R) + 11(^2, «S)(t)||H*(R) <^o 2 (l + 0~^ 

obtained from the bound on and 

/ S(t - s)d t S(s)ds = - [ d t [S](t - s)S(s)ds + S{0)S(t) - S(t)S(0), 
Jo Jo 

instead of (3.31), we may use (3. 19) (ii) rather than (3.19) (i) to get, provided / + 2m < K — 3, 

\\d l x dTv{t)\\ LPm KCll + tr^-^Eo + CEl f e-^- s \\ + s)-^ds 

Jo 

( 3 - 36 ) +CEl /**(l + i- S )-5(V2-i/p)-| (1 + s) -I ds 

Jo 

< cEoii + ty^-vp) 

and achieve the proof of (1.13) for 2 < p < oo. 

Estimate (1.14) then follows through (3.12) using (3.17) (i) and (3.23), by 

„t 

\\ip{t)\\L°°(R) < CEq + CEI / (l + t-a)-3(l + s)-3da < C£ , 

Jo 

yielding nonlinear stability in L°° since 

u(x, t) - U(x) = u(x, t) - U(x + ijj(x, t)) + U(x + ip(x, t),t) - U(x) 
so that, by Lemma 2.7, 

\\u(t) - U\\ L ooag>) < \\v(t)\\ L oom + \\$(t)\\ L oom)\\$x(t)\\ L ™(R) + ||V'(*)IIl<»(R)Sup|C^'| . 

[0,1] 

Finally, Lemma 2.7 provides (1.15). □ 

In proving Theorem 1.10, we actually got or could get more estimates than announced. Since 
we need these extra estimates to prove Theorem 1.12, we record here for later these other bounds. 

Proposition 3.8. Under the assumptions of Theorem 1.10 and with its notations, for 2 < p < oo 
and t > 0, 

Ht)\\ LPm <Eo(l + t)-^ l - l M 

(3-37) ||^r^)|| L ,(M)<^o(l + r^ 

\\v(t)\\ H K m <E (l + t)-^, 
when 1 < I + 2m < K — 3 and when 3 < I + m and I + 2m < K + 1 

||(^,^)WIU, W <^o(l + t)-^ (1 - 1/p) 
(3-38) || (At^t x ^ xx )(t)\\ LP (R) < E ln(2 + t)(l + i)~ ! 

||^rV(t)llL,(R)<^o(l + i)-i 

where v is as in (2.5). 

Proof. Estimate (3.37) (i) is (1.13) (i) and (3.37) (hi) follows from the bound on (. Estimate (3.37) (ii) 
is proved incorporating extra decay obtained from differentiation in (3.36). 

Estimate (3.38) (i) is (1.13) (ii). Bounds (3.38) (ii)-(iii) follow along the lines of (3.35) benefiting 
from extra decay provided by differentiation. □ 
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4. Behavior 

The first step in going from Theorem 1.10 to Theorem 1.12 is to obtain a refined expansion of 
the solution operator S(t) = e tL , allowing us to split further the integral equation (3.11). This 
proceeds by refining the spectral expansions from which they ultimately derive. 

4.1. Refined spectral expansion and decompositions. 

Lemma 4.1 ([48, 49]). Assuming (Hl)-(HS), we may choose a parametrization in such a way that, 
for the quantities involved in (3.5), 

(4.1) %n+i(0) = ikd k U\ { M,ky 

Proof. We have already observed that the proof of Proposition 1.7 provides 

£o(%n+i(0) - ikd k U^,k)) G s o- 

The latter point implies that d^q n+ i(0) — ikd k Uu^ k \ G So- Therefore, we only need to show that, 
for 1 < j < n + 1, (<?j(0),% n+ i(0)) = (qj(0),ikd k U KIij - k) ). 

Moreover, the proof of Proposition 1.7 also yields (qj{0),d^q n+ i(0)) = for all j /n + 1. 
We may assume that this relation holds for j = n + 1 by normalizing q n +i(0 according to 
{q n+ i(0), g n+ i(£)) = 1. Indeed, once this is done, expanding to first order in £ this normaliza- 
tion and the duality relation {q~n+i(€),Qn+i(€)) = 1 provides the desired cancellation. 

Now from the second part of (A.3)(iii), we already know (qj(0),ikd k U^^) = for all j / (n+1). 
Moreover, up to changing parametrization by a /c-dependent shift, we may add to d k U^M,k) a 
suitable multiple of U' and get 

(4.2) (q n+1 (0),fkd k U m - k) ) = 0. 

This yields (4.1). □ 
Accordingly, following [26], we refine (3.7) and re-express S(t) as 

(4.3) S(t) = R p (t) + R M (t) + R(t), 
where 

(4.4) R p (t) = (U' + d k U kd x ) e n+1 • s p (t), 
with s p as in (3.8), 

n+1 

(4.5) R M (t) := d M U ■ s M (t), s M (t) := £ sf (t), sf{t) := | Id n : | kd x ^{t), 

3=1 



and 



(R(t)g)(x) 



f e**(l - a(0)(e tL tg(0)(x)d{; + f e ^0(^n(6s(0)(*K 

J —TT J —TT 

(4.6) n+i 6 quad (f x) 



with 



^ ad (e,x) = ^-(£,x) - (iioE^K)8(o,j) - tf+i(0(?n+i(o,x) + £% n+1 (o,x)), 

;=1 



where a is an already introduced smooth cutoff supported where |£| < £o> n(£) is, as defined in 



(3.10), the complementary projection of n p (£), the spectral projection on := Span{^(£)}"' +J 
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With (4.4)-(4.5) in mind, for u satisfying (1.1), we may refine the nonlinear decomposition (2.5) 
into 



(4.7) 



z(x, t) = u(x - ip(x, t),t)- U(x) - d k U(x) kip x {x, t) - d M U(x) ■ M(x, t) 
= v(x,t) — d k U(x) ki/} x (x,t) — 8mU{x) ■ M(x,t) 



where ip(x, t) still satisfies (3.12), and M(x, t) is defined through (4.8) (iii) just below. Further, recall 
do := uo(- — ^o(")) — U, and let \ be the smooth cutoff function of (3.12)-(3.13), with x(t) = for 
t < 1/2, and x(t) = 1 for t > 1. With definitions (4.3)-(4.5), we have the following lemma. 

Lemma 4.2. For M as in (2.6) -(2.9), the nonlinear residual z defined in (4.7) satisfies 

z (t) = R(t)(d + hoU') + f R(t-s)M(s)ds 

Jo 

+ (1 - X (t))(R p (t)(d + h U') - h U' - d k Ukd x h + J R p (t - s)M(s)d s y 
(4.8) ^(t) = e n+1 • s p (t){d + h U') + I e n+1 ■ s p {t - s)M(s)ds 



(4.9) 



- (1 - X (t)) (e„+i • s p (t){d + hoU') - h + J e n+1 ■ s p (t - s)AA(s)ds) , 

M(t) = s M {t)(d + hot)') + f s M {t - s)N{s)ds. 

Jo 

Proof. Recall, (3.12)-(3.13), that we have chosen tp so that tp and v satisfy both (4.8) (ii) and 

v (t) = S(t)(do + h U')+ [ S{t-s)M{s)ds 

Jo 

+ (1 - X (t))(u'e n+1 ■ s p (t)(d + hoU') - h U' + J U'e n+1 ■ s p (t - s)M(s)ds), 

where S(t) = S(t) - U'e n+1 ■ s p (t). Using now z(t) = v(t) - d k Uk^ x {t) - d M U ■ M{t), 

R(t) = S{t) - d k Ukd x e n+1 ■ s p (t) - d M U ■ s M (t), 

and R p (t) = (U' + d k Ukd x )e n+1 ■ s p (t), R M (t) = d M U • s M {t), equation (4.8)(i) follows from 
(4.8)(ii)-(iii) and (4.9). □ 

We now establish the refined linear bounds needed to estimate the terms involved in (4.8) 
4.2. Refined linear stability estimates. 

4.2.1. Refined basic estimates. 

Proposition 4.3. Under assumptions (H1)-(H3) and (Dl)-(DS), for all t > 0, 1 < j < n, 

2 < p < oo, and for some rj > 0, < I + 2m < K + 1, and 2 < p < oo, 



d l x dTsf(t)g 



< 



mm 



(4.10) 



d l x dTR{t)g 



< min 



\l + t)-fr- 1 M-*r\\ghw 

.(i+r^ (1/2 - 1/p) -^Ni, 2w 

e~ vt \\g\\H^m+i(u) + (1 + ty^ 1 ~ 1/p) - 1 i\\g\\ L i (R) 
e-*\\g\\ H i + 2 m+ i m + (1 + t)-^(V2-i/p)-| || 5 || L2( 
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and, for 1 < r < K + 1, 



(4.11) 



d l x dTsf{t)dlg 



< . J(i+r^ (1 - 1/p) -^iMiL 1( M) 

LP(R) ~ ^ 1 (1 + t)-i(l/2-l/P)-^P-i || 5 || L2( 



(4.i2) ^r^*)^ 



< . n fe-" t ||^<7|| Hi+2m+ i (R) + (l + t)-5( 1 - 1 /p)-i|| 5 || L1(R) 

L,(M) ~ ^ \e-*\\drg\\ Hl+2m+Hm + (1 + t)-i(l/2-l/ P )-l||^| U2(M) 

Proof. Bounds on s^ 1 follow directly from known bounds on s?, whereas the proofs of estimates on 

R are completely similar to the proofs of bounds on S in Proposition 3.3, with extra decay coming 
from a higher-order expansion of <j>j{£), leading to an extra £ factor in the third term of (4.6). □ 

4.2.2. Refined linear modulation bounds. 

Proposition 4.4. Under assumptions (Hl)-(HS) and (Dl)-(DS), for all t > 0, 2 < p < oo, and 
0<l + 2m<K + l, 

(4.13) \\d l x dTsf{t){h u')\\ Lvm < (i + tr^ (1 - 1/p) -^RMLi W , 

(4.14) \\%d?R(t)(h U')\\ LP( s ) < (l+tJ-^'-^-slia^Hii^n^+^+iCR). 

Proof. Again, bounds on Sj f follow directly from known bounds on s?, whereas the proof of esti- 
mates on R is completely similar to the proof of bounds on S in Proposition 4.4, with extra decay 
coming from the higher-order expansion of <j>j(£), leading to an extra £ factor in the third term of 
(4.6). ' □ 

4.3. Refined nonlinear stability estimates. With these preparations, we obtain the following 
refinement of Theorem 1.10. 

Proposition 4.5. Under the assumptions of Theorem 1.10, for all t > 0, 2 < p < oo, 

\\z(t)\\L Pm <E H2 + t) (l + t)-§ 

(4.15) 3 

\\d l x d?z{t)\\ Lv(m < Eo(l + t)", 1 < I + 2m < K - 3 

and 

\\M(t)\\ LPm <CE (l + t)-^-yp) 

(4.16) \\(M t ,M x )(t)\\ LP{R) <CE \n(2 + t)(l + t)-i 

\\d l x d^M{t)\\ Lv[m < CE (1 + t)-t, I + m > 2, I + 2m < K + 1. 

Proof. By (3.38) and (3.37), we find 

\\Af(t)\\ H K-2 {R) <CEo(l + t)-l 

Applying bounds (4.10) (i)-(4. 12) (ii) and (4.13)-(4.14)(i) of Propositions 4.3 and 4.4 to the system 
(4.8), we obtain for any 2 < p < oo, 

ft 

||*(*)||lp(r) < CE (1 + t )-|(i-i/p)-| + CE / (1 + 1 - sJ-sCVa-i/p)-^! + s )~Us 

Jo 



3 



< CE \n(2 + t) (1 + t)"*, 

verifying (4.15) (i). Estimate (4.15) (ii) is proved in the same way dropping the log term thanks to 
an extra decay of (1 + 1 — s) -1 / 2 in the integral. Bound (4.16) follows similarly as for the previous 
bounds on tp Xl ipf D 
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This verifies the estimate (1.16), thus validating the ansatz (1.6). It now remains to establish 
comparisons with solutions of (1.7). Note that, as explained in Appendix B, taking into account 
simplifications due to asymptotic equivalence of quadratic approximants and the change of variables 
(Id — ^(-,i)) _1 ) ^ is sufficient to prove comparisons with solutions to system (B.36), which we do 
below. 

4.4. Quadratic approximation. Before entering into comparisons with the Whitham equations, 
we identify now the main part of the nonlinear terms. To this end, we first need estimates revealing 
the characteristic speeds of each of the linear parts. For this purpose, we introduce for aj as in 
(3.3) 

Dj = dt + aj kd x . 

Then setting 

(4.17) Vj=pW(0), Vj=pW(0), 

we define the following total derivatives 



Note that, more compactly, we actually have 



d = d t + 



dF {( 
-duj 



\(M,k) 



I (AM) ) - eld ) kd x = d t + A*kd x 



Lemma 4.6. Assuming (H1)-(H3) and (Dl)-(DS), for all t > 0, 2 < p < oo, 



d x d?Ds*{t)g 



< 



LP( 



mm 



•(1+£)-§(1-1/p)-^-§ 

(! + *)■ 



\9\\m 



.1(1/2-1/ P )-^-I|| 5 || l 



& x a?D 8 v(tW x g) 



Lp( 



< min 



(l + tJ-i^-i/rt-^-iy^ 



when 1 < r < K + 1, 



(4.18) 



d x d™Ds p (t)(h u'] 



and, for g periodic of period 1, 



(4.19) 



d l x d?D S v{t){h Q g) <(l + t)-|a-Vp)-^||^ || L 

Lp CM) ' 



Proof. From the observation that 
(4.20) 

{tf x a?D#{t)g){x) = Y t V j [ W a(Oe A >«M' Aj ' (0 + ^ a3 ' (»0'A 3 -(O m <^(0^(0)^ 

~[ J-7T lk£ 

+ E V; r a(o e vtt)* e ^ A ^(o + (itfx^r Vj • - ^)( 0) ) (4> f (o,m))dti 

j,r=i J ~* m 

with for all j, j' 

Xj(0 + itoj = 0{i% (V(0 + itajVj • W«\0 - ^')(0)) = o(^ 2 ), 
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the proof follows the lines of previous estimates on sj. In particular, the discrepancy between decay 
rates in (4.18) and (4.19) is due to the fact that (4.18) benefits from cancellations 

(<^(0), U') = 0, for all 1 < j < n + 1. 

□ 

Remark 4.7. The fact that in (3.8) we have kept /3j(£) instead of (3j(0) precludes similar higher- 
order estimates; indeed, it may be readily checked that application of D q does not enhance decay 
by (1 +*)"'. 

The previous linear estimates may be transposed to the nonlinear level as follows. 

Corollary 4.8. Assuming (Hl)-(HS) and (Dl)-(DS), the phase ip of Theorem 1.10 and the mean 
M of Theorem 1.12 satisfy for all t > 0, 2 < p < oo, 

(4.21) \\^t) + ~kdc m - k) (M(t)Mx(t))\\L P (R) <E ln(2 + t) (1 + t)" 3 / 4 

\\# X &?D (M(t), U x {t))\\ LP(m < E (l + ty 3 /\ 0<l + 2m + 2<K-l . 

Proof. Completely similar to the proofs of (3.38) in Proposition 3.8 and (4.16) in Proposition 4.5. 

□ 

Now the next lemma pulls out the dominant part ofN(t). 
Lemma 4.9. Assuming (Hl)-(HS), (Dl)-(DS), we have 

Af(t) = d x (M(t) T f MM ■ M(t) + H x (t)fl M ■ M(t) + fUH x (t)f + n(t)) 
+ dt {M{t) T g\ 1M ■ M{t) + U x {t)gl M ■ M{t) + g p kk (k^ x (t)) 2 + r 2 (t)) , 
where fj k , g v - k are periodic of period 1 and \\rj(t)\\ L i^ < Eq ln(2 + t)(l + t)~ l , explicitly 

fliM = ~h 7} d2 f(U)(d M U, d M U) + kd M c d M U, 

fkM = ~ k d 2 f(u)(d k u, d M u) + w k c d M u + kd k u d M c + kd M u', 
fkk = ~~ k \d 2 f{u){d k u, d k u) + kd k c d k u + u' + kd k u' 

and 

9mm = 0. 9\ M = % dM U, 9 P kk = ^d k U. 

Proof. This follows by a direct, but tedious, computation, using estimates (3.38), (4.15), (4.16) and 

(4.21) , formulas from Lemma 2.3 and v = z + d k Ukip x + 8mU ■ M. □ 

4.5. Comparison with the linearized Whitham equations. Now we begin comparisons with 
the Whitham equations by proving that the dynamics described by s p (t) are well-approximated 
by the evolution of the linearized Whitham equations. To this end, we linearize about w = the 
quadratic approximant of the second-order Whitham system (1.7) and get 

(4.22) w t + kA*w x = k 2 B*w xx . 
We begin with some observations about (4.22). 

Lemma 4.10 ([48, 49]). Assuming (Hl)-(HS), the coefficients aj, bj of the expansion Aj(£) = 
—ik^aj+{ik^) 2 hj+0(^) given in (3.3) are given by the eigenvalues of the simultaneously- diagonalized 
coefficient matrices A*, B*, respectively, of the linearized quadratic approximant (4.22). 
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Proof. The first-order relation has already been established above in Proposition 1.7. The second- 
order relation essentially follows from the fact that the processes of linearization and formal expan- 
sion commute. The exact computations depend on whether we obtained the modulation system 
with the strategy followed in Section B.l.l or with the one in Section B.1.2. In the latter case, com- 
putations are rather light while in the former case they are quite tedious but completely similar to 
those in [48, 49] in the context of the Saint- Venant and Korteweg-de Vries/Kuramoto-Sivashinsky 
equations. See also the proof given in [18] in the case of the Kuramoto-Sivashinsky equation. □ 

Remark 4.11. It is worth mentioning that the diffusion matrix of (1.7) is uniquely determined only 
up to asymptotic equivalence, i.e., it is B* and not B* that is uniquely returned by the process of 
formal expansion. The relation between bj and o~(B*) was used in [18] as a means to verify the 
spectral stability assumption (D2) for the critical eigenvalues 9?Aj(£) ~ fc §R6j£ 2 . However, in the 
more recent studies [5, 6], we find it more convenient to instead verify (D2) entirely by numerical 
Evans function study, determining a,j bj at the same time by numerical Taylor expansion. 

Lemma 4.12. Assuming (Hl)-(HS), the solution operator £(i) of the linearized quadratic approx- 
imant (4.22) is given by 

ra+1 
3=1 

where the Uj{t) are solution operators of the convected heat equations 

Ufc ~\~ CljJvUx — b j k Uxx ; 

and the vectors Vj and Vj are defined in (4.17). 

Proof. This follows by a straightforward diagonalization argument. □ 

We now come to linear comparisons, i.e. we aim to verify that the dynamics described by s p (t) 
are well-approximated by the evolution of the linearized Whitham equations (4.22). To write the 
comparison results as compactly as possible, we introduce the operator I defined by 

(4-23) Tg(Z) = ±g{£). 

Note that X can be identified as the convolution with a step function, and hence takes L 1 (M) into 
L°°i 



Proposition 4.13. Assuming (Hl)-(HS) and (Dl)-(DS), letH(t) be the solution operator of (4.22) 
and g be a periodic function on [0, 1], g 6 -f^ 1 ([0, 1]). Then, for all t > 0, 2 < p < oo, 



(4.24) 



\^(t)(h u' + d) - ( d " { %Z ))9xho ) iilp( 



< 



(1 + *) p 1/p) U l ^(\\d x h \\ L i(u )nL ^ {R) + ||d||Li(R)nL°°(R)). l > 1 
(1 + t)"^ 1 - 1 /^ (\\d x h \\ L i {R)nL 2 (R) + ||d|Ui(R)nL2(R)), I = 



% S v{t)d x {h Q g) - i£(t)Xd< 



n 



(4.25) 



[id) - i^2(d i qi{0),d x g)ei]d x h 

1=1 I »LP( 



k 

\ k(q n+1 (0),d x g)d x h o 



< 



\l + t)-h( 1 -Vp)-h- l ^\\d x h \\ L i {m)nL oo {m) , l>\ 
(1 + tr^-^Wd^oh^nmm), I = 
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1 i-1 



< 



(i + ty^-yp)-^-— ||^/io|| L i(R)nL-(R), i > i 

(1 + i)"^ (1_1/?,) ll^^o||Li(M)nL2(M), I = 



Proof, (i) (Proof of (4.24) ; case ho = 0). Following the proof of (3.14) in Proposition 3.3, we 
obtain that the difference between d l x s p (t)(d) and the function 

(4.27) x ^|^^e(-^^ 

is bounded in L P (R) by (1 + tJ-^-VpM ||d|| L i (R) . Since for 1 < j < n + 1 and |£| < vr we have 

ai 



the function in (4.27) is recognized to be the low-frequency part of jT,(t)ld l x y^j , whose high- 
frequency part is still to be bounded. When I = 0, we bound it in L P (R) by Ce _?? *||d|| L i/(i/p + i/2)nR) 
using, for 1 < j < n + 1 

L 2 (R\[-7T,7r]) 

for some rj > 0. When I > 1, however, we bound the high-frequency part of |£(i)Xc^ in 



i-i 



L P (R) by Ct 2 e ^ ||^||lp(r), recognizing it as the convolution of d with a kernel that is bounded 
pointwise (using Haussdorff- Young estimates) by 



! + t 



which is bounded in L^IR) by 2 e vt , for some rj > 0. 

(%%) (Proof of (4.24), case d = 0). Following the proof of (3.23) in Proposition 3.5, we obtain 
that the difference between d l x s p (t)(hoU') and 
(4.28) 

n+l 



X I > 

J 

n+l 



£ f ^e(-^^-^^ 2 )*X(^^ (j ^O)(^(O,0,^q/ i o(e,-) -fco(0]>L»([o,i])* 

n+l /.7T _ _ -I 

+ £ / e ^ e (-^?-fc ^ )* W) ^0) (0) (^.(o, •) + e^(0, •), ^o(0>L2 ([ o,i])^ 

is bounded in D>(R) by (1 + t)-^ 1 - 1 M-i\\d x h \\ L i iR) . Now, as in the proofs of (3.24)-(3.25) in 
Proposition 3.5, we observe that, for 1 < j < n + 1 and £ G [— 7r, 7r], 

*(o), fmo - mod = -% • ( [(c7 " <^no) + * % . imo- mod) 



^•(o) + id^{% = e(^^-(o), c/') mo = v$- • 
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Up to a term that is also bounded in L P (R) by (1 + t) 2^ Vp) 2 ||<? x /io||z,i(R), the function in 
(4.28) is recognized as the low-frequency part of ^T.(t)ld l x ^ ^ kd^h^^^j' ^ e rema i mn § 

high-frequency part is bounded as in the case ho = above. This completes the proof of (4.24) in 
the case d = and by linearity the proof of (4.24) in any case. 

(Hi) (Proof of (4.25) ). Combining elements of the proofs of (3.14) in Proposition 3.3 and (3.23) 
in Proposition 3.5, we obtain that the difference between d x s p (t)d x (hog) and 



n+l 

X I ) 

3=1 
n+l 



n+l 

„-_1 



J' = l 

is bounded in LP(R) by (1 + t)-^ 1 - 1 M-i\\d x ho\\ L i (R) . Since, for 1 < j < n + 1 and f € [-vr,vr], 



(&(O,-),i£MO0> = ^ 

(4.29) 



(g)d x h Q (0 




k(q n+1 (0),d x g) 



the function in (4.28) is the low-frequency part of 

k x - 1=1 

\ k(q n+1 (0),d x g)d x h o J 

whose high-frequency part is bounded as above. 

(iv) (Proof of (4. 26) ). Following the proof of (3.23) in Proposition 3.5, we obtain that the 
difference between d l x +1 s p (t)(hog) and 



X 

n+l 

X I > 

3 



n+l _ _ -. 

£ / e^e(-^^- fc26 ^ 2 )* T -(iO' +1 /3 (j) (0)(^(0, O.MO^ao,!]^ 



is bounded in L P (R) by (1 + i) _ 5( 1_1 /p)-| H^c^io ||z,i (ir) ■ Again this function is the low-frequency 
part of the expected term, whose high-frequency part may be bounded as in the proof of (4.24) in 
the case ho = 0. □ 

Note that, since A* and £(t) commute, combining (4.26) with (4.19) we obtain for any periodic 
function g the bound 

\\d l x d tS v(t)(h g) + mAX ( (5 ^ ) 

< ((l + tr^M-ht-^wd^ohi^L^iR), i>h 

~ \(1 + t)-3( 1 -Vrt|| 5x / l0 || i , 1(R)ni2(R)j | = . 
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4.6. Nonlinear connection to the Whitham equations. Before proving that the local means, 
wavenumber, and phase are indeed well-approximated by solutions of the appropriate Whitham 
equations, it is necessary to compute some averages involving the main part of the nonlinear term 
N identified in Lemma 4.9. This is the purpose of the next lemma. 

Lemma 4.14. Assuming (Hl)-(HS) and (Dl)-(DS), we have the identities 



1 f if MM) - i E^ 1 ( () )' 9 ^MM) e ! | = f-2 9 M F \(M,k) + d MC\ (h; .., , 

k{q n+1 (0),d x f MM ) 



\ 

^°M LU \(M,k) 



i J (4 p m) -*£<%~Ko),^/ fc p M > e/ j _ K ^ 

k(q n+1 (0),d x f% M ) 



p 



-dkM F \(M,k) + dfc c |(M,fc) Id ~ \ (^M-P|(M,fc) ~ cId 
dfcM w |(M,fc) 



»)• 



n 



I I </&> -<E<W)'<W&>« I _ A f(9lk)\ = (-\ d l\M,k) - \^ F \(M,k) 



k I j.J- 1 . . p . | * V J V 2 d k u \{M,k) 

Proof. Simple means are computed directly, for example 



J k V kd M c l{M ^ j 



To compute the remaining terms, we first use the identity Id = 5^j=i ^j'^j and come back to 
(4.29). Using the algebraic identities (A. 3) (in), (A. 4), (A. 5) and (A. 6), then, we find that 

^i/m = p' ^M UJ \(M,k) ~ 2 L °^ mU \( m ^)^ 

®xfkM = ~U' kdl M c\( My k) - L Q d kM U\(M,k): 

®xfkk = p' ®k u \{M,k) ~ 2 L 0^M U \(M,k) - T L o9kU\( Mk \y 

Now, observe that, for 1 < j < n + 1, expanding the fact that for, |£| < £oj fijiO is a left 
eigenfunction of associated to Aj(£) yields for any periodic g the identity 

z(d^(0),L g) = k(^(0),L^g) + a j k(4> j {0),g) = kVj ■ (^q^) + kVj ■ A, ^ . 
Since, for 1 < j < n + 1, 

i{d^j(0),U') = -k, 

the proof of the lemma is then achieved by simple direct computations. □ 

We have now in hand all the pieces needed to achieve the proof of Theorem 1.12. 

Proof of Theorem 1.12. Estimates (1.13) and (1.19) already follow from Proposition 4.5 together 
with Lemma 2.7. 

Using Duhamel's principle we may write (B.36) as 

(t) w - m ( 4 tT) + 1 - •».(§ (t ) T r - (?;) ) <•*■ • 
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where m(ho) := —(U — M)d x ho and £ is the constant-coefficient solution operator defined in 
Lemma 4.12. On the other hand, using Propositions 3.3, 3.5 and 4.13, and Lemmas 4.9, 4.12 
and 4.14, we find that 

where the residual r p (i) satisfies the bound 

IP(0IIlp ( r) < E ln(2 + t) (1 + tJ-la-VpH, 2 < p < oo, 

where £b is defined as in Theorem 1.10. Thus, subtracting and defining 5 := (M, kip x ) — (Mw, kw), 
we have 




Letting r] > be fixed but arbitrary and defining 

u{t):= sup sup imh^l + s)^ 1 - 1 /^-*, 

pG[2,oo] 0<s<t 

we thus obtain by the standard bounds (see [41]) W^(t)d x f\\ LP ^ < (7t~5( 1 /3 _1 /p)~l when 
1 < q < p < oo, that, for any 2 < p < oo and all t > 0, 

PWIIlp ( R ) < ln(2 + t) (1 + t)-3 (1 - 1/p) "^ + / (* - S )-^ (1 - 1/p) -^(t)i5;o(l + s)- 1+ ^ds 

Jo 

+ / {t - s)-^ 1/2 - 1/p) -^v(t)E (l + s)-* +ri ds 

Jt/2 

<EoM*) + l)(l + r^ 1 " 1/p)4+ '' I 

which in turn yields u(t) < C V E (1 + v{t)) . Thus, if E < 1/(2C V ) it follows that v{t) < 2C V E 
for all t > 0. This provides the needed bounds on (M — Myy,kip x — kw)- Using these bounds a 
simpler computation yields the result for V>w — D 

Remark 4.15. Though the computation of quadratic coupling coefficients is heavy going, we note 
that already from Proposition 4.13 and Lemma 4.12 one may conclude that long-time behavior 
is governed to leading order by some " Whitham-like" system (B.27), with no computation at 
all, since in the Duhamel formulation the principal part of nonlinear terms factors on the left as 
£v(t-s)d x (...)(s)ds. 
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Appendix A. Algebraic relations 



We record in this appendix some crucial relations obtained by differentiating the profile equations. 
In order to differentiate, we here consider variable parameters (M, k) rather than fixed values (M, k), 
imposing dependence implicitly through the profile equations (denoting (a) '■= J a)'- 

(A.l) k 2 U" - k{f(U))' + kcU' = 0, (17) = M. 

We expand L ? = L + ifc£L (1) + {ik£) 2 L^ with 

L v = k 2 v" - k((df)(U) v)' + kcv' ; 
(A.2) L^v = 2kv'- (df)(U)v + cv ; 

L^v = v. 

Then, by differentiation of (A.l), we obtain 

L U' = 0, ([/') = 0; 

(A.3) L d M U + kU'd M c = , (d M U) = Id; 

L d k U + W d k c + L^U' = , (d k U) = 0. 

Accordingly, with lo = —kc, using L^U' = W", we have 

< d 2 U >= , 

(A.4) L d 2 M U + 2k (d M c) (d M U)' - k [{d 2 f){U){d M U, d M U)]' = -U'kd 2 M c, 

L d 2 M u + k(d k c){d M uy + k{d M c){d k uy -k[(d 2 f){u){d k u,d M u)]' 
+ k(d M uy> = -u'kd 2 kM c , 

(A.6) L d\U + 2fe (5 fcC ) (d k U)' - k [(d 2 f)(U)(d k U, d k U)]' + 2k (d k uy = -u' k d 2 k c . 

Appendix B. The Whitham equations and asymptotic equivalence 

In this appendix, we explain how to obtain the needed formal averaged modulation system for 
comparison to our analytical description of asymptotic behavior. This is performed in three steps. 

(1) First, we develop a direct WKB-like formal approximation. At this stage we obtain a system 
that may contain harmless irrelevant terms. 

(2) Next, we use known results about large-time asymptotic behavior of systems of conservation 
laws about constant states to get a canonical form for the averaged modulation system. 

(3) Finally, we adapt the system taking into account the fact that the analysis of the main part 
of the paper is carried out after an implicit nonlinear change of coordinates. 

B.l. Formal asymptotics. Though the full nonlinear analysis may be carried out without dis- 
tinction between linearly coupled and linearly uncoupled cases, the formal derivation of averaged 
equations involves resolutions of systems of the form L$g = h and therefore requires knowledge of 
the kernel of L . We are thus compelled to provide two separate derivations. 

Besides, there are at least two ways to obtain relevant averaged equations. The first one is to 
develop a full WKB-type expansion as in [48, 49], extending the procedure in [62] to get higher 
order equations. This method provides the hyperbolic part of the averaged system in a quick way 
and a nice form. Its main drawback is that it requires a knowledge of the kernel of Lo for all waves 
close to the wave under study essentially reducing the scope of the method to the nondegenerate 
case or to a fully degenerate case where Omc would vanish in a neighborhood of the studied wave. 27 



27 A 

situation that trivially occurs when some symmetry is present, see Remark 1.27. 

44 



The second method is designed to study dynamics about a given wave, so that it does not suffer 
from the same flaws; moreover, it is closer to our nonlinear analysis, and yields a semilinear system. 

We derive the system for the generic case with the first method and the one for the linearly 
uncoupled case with the second one. Note that both methods provide averaged systems with 
diffusion matrices containing terms that are not relevant for our present analysis. 

B.l.l. Generic case. To treat the linearly phase-coupled case, we essentially borrow the derivation 
of [49] for the Korteweg-de Vries/Kuramoto-Sivashinsky equation, a model for which linear phase- 
coupling is a consequence of assumptions (H1)-(H2) and (D3). In the present derivation, we assume 
that all the waves involved in the slow-modulation description satisfy (H1)-(H2) and (D3) and are 
linearly phase-coupled. 

Since in this derivation there is no reference wave, thus no privileged frame, we go back to the 
original equation 

(B.l) ut + f(u) x = u xx . 

We are looking for a formal expansion of a solution u of equation (B.l) according to the two-scale 
ansatz 

(B.2) u(x,t) = u( ^ £X,£t ^ ;ex,et 



where 

(B.3) U(y,X,T) = J2e j Uj(y;X,T) and \Jr (X, T) = £ £ %(X, T) , 

3 3 

with the functions U and Uj being 1-periodic in the y-variable. We insert the ansatz (B.2, B.3) into 
(B.l) and collect terms of the same order in e. 

First this yields, with Qq = dx4>o and kq = dx^o, ^odyUo + Kod y (f(Uo)) = K^dyUo, which is 
solved by 

n (X,T) = -k (X,T)c(M (X,T),K (X,T)), 

(B.4) 

U (y;X,T) = U(y; M (X, T), kq(X, T)). 

We have disregarded in (B.4) the possibility of a phase shift dependent on (X, T) since this is 
already encoded by ^i. We will have to rule out similar problems of uniqueness in the following 
steps. At this stage the compatibility condition drdx^o = dxdr^o already yields the first equation 
of a Whitham's modulation system: 

(B.5) d T Ko + dx(K c(Mo,Ko)) = 0. 

In the rest of the derivation, we will use the notations of Proposition 3.1 and Appendix A, with the 
convention that operators act in y and are associated to the wave profile U( • ; Aio(X, T), kq(X, T)). 
To fix some of the uniqueness issues of the ansatz, we pick, for any (M,k), u ad i ( • ; M , k) a gen- 
eralized zero eig enfunction of such that (u adj , d M U m ,k)} = and (u adj , [/'(•; M, k)) = 1, set 
UQ dj ' (y; X, T) = u adj '( ■ ; Mo(X; T), Ko(X, T)) and add to the ansatz the normalizing condition 

(B.6) (u a dj ,U j (-;X,T)) = 0, j + 0. 

The next step of the identification process gives, with = dr^\ and n\ = dx^i, 

(n 1 +c(M ,^)n 1 )d y Uo - KiL^dyUo - L U! - L^dxUo 

(B.7) 

- dxK LWd y Uo + d T U + c(Mo,K )d x Uo = 0, 
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whose solvability condition reads 

(B.8) &rMo + d x (F(M ,Ko)) = 0, 

where F denotes the averaged flux F(M, k) = {f(U( • ; M, k))). To proceed, for arbitrary (M, k) 
we introduce g k (- ;M,k), g M ( ■ ;M,k) solutions of 

(B.9) L (g k (-;M,k)) = - L^d k U m>k) - d k F m>k) - L^U[ {M>k) 

- 9kU\(M,k) &<9fcC|(M,fc) - {dMU\(M,k) - Id) <9fc-F|(M,fc) 

(B.10) L (g M ( • ; M, k)) = -L^d M U\ {M ,k) ~ d M F\ {M ,k) + c\ {M ,k)U 

-9kU\(M,k) kdMC\(M,k) - (dMU\( M ,k) - Id)[<9M-F|(M,fc) ~~ c |(M,fc)Id] 

orthogonal to u ad i(-;M,k) and set g = (g M g k ) and g (-;X,T) = g( ■ ; Mq{X, T), k (X, T)). 
Then with (B.5)-(B.8) and (A. 3) equation (B.7) reads 

(B.H) Lo(tfi - dU mo , Ko) (--Mi,Ki)-9o (^°)) 

= (ill + «orfc| (iMo)Ko ) + C| ( . Mo)Ko )/ci)5j / ?7o 

for any choice of .Mi. Let us set Mi = (Ui). Choosing Mi to get 
(B.12) Qi + kq dc(Mo,K )[Mi,K 1 ] + c(Mo,K Q )K 1 = 

and normalizing the parametrization, as in Lemma 4.1, to get, for any (M,k), 
(B.13) (« a *(-;M,fc),O fe C/| (M)fc) )=0, 
equation (B.ll) is reduced to 

Ui = dU l{Mo!Ko) {Mi,Ki) + g 0£^ o °) 

+ < 9 .) (tr °) ■ 

Then, compatibility condition <9tki = <9x^i yields 

a T Kl + 5x(ko^C| (A1oiKo ) [Ml,Ki)+C\ {Ma)Ka) Kl) = 



3x ^ 9mC| (A 4 0iKo ) (g ) y dxKo 

Returning to the identification process, we obtain an equation of the form 

drUi + d x {df{U )Ui) - 8 2 X U - L U 2 + 8 y { ■ ■ ■ ) = 0, 
whose solvability condition is 

d T Mi + d x (dF l{MfhKo) [Mi,Ki]) = d x M - dx((df(U )g ) ( d ^°)) 
(B.15) , V / rl K M\\ 

+ dx(d M F l{Mo , Ko) (g ) (%™°)). 
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To write the second order system in a compact form, let us introduce, for arbitrary (M, k), 
d hl (M,k) = U-(df(U(M,k))g M (M,k)) + d M F(M,k)(g M (M,k)) 
d 1:2 (M,k) = -(df(U(M,k))g k (M,k)} + d M F(M,k)(g k (M,k)} 
d 2il (M,k) = kd M c(M,k) (g M (M,k)) 
d 2 , 2 (M,k) = kd M c(M,k) (g k (M,k)). 

With these notations, systems (B.5,B.8), (B.14,B.15) coincide with the first systems obtained in 
the formal expansion of a solution (M , k) of 



(B.16) 



f d t M + d x (F(M,K)) = d x (d 1 , 1 (M,K)d x M + d 1 , 2 (M,K)d x K) 
\ d t K + d x (Kc(M,K)) = d x (d 2 ,i(M,K)d x M + d2,2(M,K)d x K) 

according to the slow ansatz 

(B.17) (M,K)(x,t) = ^e i (7W i ,K j )(ex,et). 

3 

We call system (B.16) a (second-order) Whitham's modulation system. 

B.1.2. Phase- decoupled case. For the phase-decoupled case, we propose an alternative derivation 
that would also work for the uncoupled case. We pick a wave of parameters (M, k) and assume 
that it satisfies (H1)-(H2) and (D3) and is linearly phase-decoupled. 

We again insert the ansatz (B.2,B.3) into (B.l) and collect terms of the same order in e but this 
time we specialize to (Mq,kq) = (M,k). We keep (B.6) as ansatz normalization and (B.13) as 
parametrization normalization. The first nontrivial equation is with = (dr^i,dx^i) 

(B.18) (fii +c{M,k)K l )U' - ki L^U' - L Ui = 

which may also be written as 

L (lli - dU m ,k) ( • 5 Mi, ki)) = + kd k c(M, k) m + c(M, k) Kl ) U' 
for any Mi. Solvability yields 

fii + kd k c(M, k) ki + c(M, k) m = 
and with our normalization choices (B.18) reduces to 

Ui = dU^M,k)(Mi,Ki), Mi = (Ui). 
Compatibility condition Ot^i = dx^i already gives 
(B.19) d T ni + dxikdkC^ ^Ki + c(M,k) ki) = 0. 

At the next step of the identification, we get with (Q 2 , k 2 ) = (8t^ 2 , dx^ 2 ) 
(n 2 + c(M,k)K 2 ) U' - k 2 L^U' - L U 2 - L^dxUi 

- (KifL^U" - kiL^U[ - dxKiLm* 

+ (ni + c (M,k)Ki)u[ + d T Ui + c(M,k)d x Ui 



+ kd y (^d 2 f{u){Ui,Ui)^j + Kid y (df(u)) Ui = o, 
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which may also be written 

(B.20) L (u 2 - dU m - k) (-;M2,K 2 )-^d 2 U m - k) (-;(M 1 ,K 1 ),(M 1 ,K 1 ))') 
= d T U 1 + c{M,k)d x U 1 - dxniL^U'- L^d x Ut 
+ (n 2 - d k uj(M, k) k 2 -^d 2 u;(M,k)((Mi,K 1 ),{Mi,K 1 ))^ U' 

for any M\. Solvability then reads 
(B.21)drMi + d x {dF m ,k)(Mi,Ki)) = 0, 

(B.22) n 2 - d k u{M~k)K 2 - ^d 2 uj(M,k)((M 1 ,K 1 ),(M 1 ,K 1 )) 

= d X Ki + (u^iM^L^dkU^M^xK! + (u adj (M,k),L^d M U m ,k))dxM, 
Note that the latter equation yields 

(B.23) d T K 2 -d x (d k oj(M,k) K2 + ^(^.^((Mi.kO^Mi.ki)) 

= 8&Ki + d x ((u ad i(M,k),L^d k U m - k) )d X Ki + {u adj {M~k),L^d M U m ,k))dxM^ . 
To proceed, we introduce g k , g M , the solutions of 

+ f/'(^(M,I),l(%C/ |W )-3^ |( ^ fcd fcW) , 

L 9 M = -L^QmU^m,^) ~ d MU\ { M^){dMF\ { M^ - c^^Id) 

+ U'{u ad i(M, k),L^dMU^,k))^ 

mean free and orthogonal to n adj ( • ; M, k) and set 5 = (g M g k ). With (B.21) and (B.22), setting 
M. 2 = (C/2), equation (B.20) becomes 

U2 = dU m - k) (M 2 ,K 2 ) + ^| ( m|)(-;(Mi,«i),(Mi,«i)) + fif (a^ 1 )- 

Finally, substituting (B.2,B.3) into (B.l), and comparing terms of order e 3 , we obtain an equation 
of the form 

d T U 2 + 8 X ^/(C7)C/ 2 + ^ 2 /(^)(^i,^i)) - & x Ui ~ L U 3 + d y {---) = 0, 
whose solvability implies 

(B.24) IhMi + d x (<i-fi(M,i)(jH 2 ,K2) + i<i 2 Ji ( M,i)[(^(i>'«i),(A(i,K 1 )]J 

- - * (<#<ff)0 

To write the second order system in a compact form, let us introduce, for arbitrary (M, k), 
di,i = Id-(#([/(M,fc))5 M >, di )2 = -(df(U(M,k))g k ) 

d 2A = (u ad ^M,k),L^d M U m - k) ), d 2 , 2 = l + (u ad i(M,k),L^d k U m - k) ) ' 
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(B.25) 



With these notations, systems (B.19,B.21), (B.23,B.24) coincide with the first nontrivial systems 
obtained in the formal expansion of a solution (M, k) of 

d t M + d x (F(M,K)) = d x (d 1>1 d x M + d 1 , 2 d x K^ 

d t K + d x (Kc(M,K)) = d x (d 2 ,i d x M + d 2 ,2 d x Kj 

according to the slow ansatz 

(M,K)(x,t) = ^2e j (Mj,Kj)(ex,et), (M ,k ) = (M,k). 

3 

We call system (B.25), likewise, a (second-order) Whitham's modulation system. 

As should be clear from the formal derivations, there is some freedom in the choice of the diffusion 
matrices. This reflects the fact that many systems of conservation laws share the same asymptotic 
behavior about constant states. We recall next how to classify these systems according to their 
asymptotic behavior; this will provide a canonical modulation system for our nonlinear analysis. 

B.2. Asymptotic equivalence of systems of conservation laws. 

B.2.1. General theory. We now recall the notion of asymptotic equivalence and behavior of solutions 
of systems of conservation laws near a constant state, useful in our context since, being able to prove 
modulational behavior, we reduce the dynamics about a periodic wave to motion of parameters 
near a constant state. Given a general system of conservation laws 

(B.26) w t + {g{w)) x = {B{w)w x ) x 

and a reference state w* at which dg{w*) has distinct eigenvalues, so that L*dg(w*)R* is diagonal 
for some L* = : I . R* = (r* ... r*), L*-R* = Id, define the quadratic approximant 

w 

(B.27) y t + A*y x + ^(y*^?/)^ = B*y xx , 

and the decoupled quadratic approximant 
(B.28) 
about w*, where 

(B.29) A* = dg (w*), := d 2 g(w*), and B* := B(w*), 

(B.30) f * := L*diag{i?*r*.R*}L*, and B* := R^diagiL^B^R^L,. 28 

Assume the parabolicity condition, diag{L* B(w*)R*} is positive, and define the self-similar 
nonlinear (resp. linear if jj = 0) diffusion waves 9j(x,t) = t~ l l 2 6j(x/ \fi) to be the solutions of the 
Burgers equations (resp. heat equations if 7j = 0) 

(B.31) t + \(n*e 2 ) x = 9 XX , 7* == [/*(r*)T,r*]/[/*^r*], 

with delta-function initial data l*m$ 5(-), where ttiq := J zo(x)dx. Then, we have the following 
fundamental result describing behavior of (B.26)-(B.28) with respect to localized initial perturba- 
tions. 



(B.28) z t + A*z x + -(Az) x = B*z x 



2 ^Here and elsewhere, we identify as usual bilinear maps with vector- valued matrices, and in particular d 2 g(w,) 
with Hess(g)(w*). Moreover, for these vector- valued matrices, we use coordinate notations so that for instance 
Ti £ R nxn satisfies w^iw = (ti)T,B))j = d 2 gj (w,w). 
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Proposition B.l ( [35, 41] ). Let i] > 0. Let w and z be solutions of (B.26) and (B.28) with 
initial data wo and zq = wq — u>* such that E\ := ||^o||z, 1 (R)n^ 4 (E) + II I ' I^oIIl^m) is sufficiently small. 
Then, for 1 < p < oo, ijiq := J r zq, and 6j as in (B.31), 

(B.32) \\w(t) -w.- z(t)\\ LP{m <E 1 (1 + t )-^-yp)-k+v. 

and 

(B.33) \\z(t) - J>(- ~ a*(l + t),b*(l+t)) r*|| LP(R) < ^(1 + t)-§(i-VpH+»», 

j 

with a* := ljA*r*, b* := ljB*r*, whence 29 , ifrj< 1/4, 

IH^-^IIlpCR). lk(*)llw(R) £ Kl(i + ^ (1 ~ 1/p) • 

Proposition B.l asserts that (B.26) and (B.28) (hence also (B.27)) are asymptotically equivalent 
with respect to small localized initial data w $ — w* = zq € L 1 (R, (1 + |x|)dx) n H 3 (R), in the 
sense that the difference between solutions z(t) and w(t) — w* decays at rate (1 + t)~z ( 1 ~ 1 /p)~j+ r i 
approximately (1 + i) _ * faster than the (Gaussian) rate \mo\ (1 + f.) - ^ 1-1 /^) at which either one 
typically (i.e., for data with small L 1 first moment) decays. Moreover, through (B.33), it gives a 
simple description of asymptotic behavior as the linear superposition of scalar diffusion waves 8j 
moving with characteristic speeds (eigenvalues a*) in the characteristic modes (eigendirections r*) 
of dg(w*), satisfying Burgers equations (B.31). 

We have also the following more elementary result comparing to the full quadratic approximant. 

Proposition B.2 ([26] 30 ). Let r\ > 0. Let w and y be solutions of (B.26) and (B.27) with initial 
data wo and yo = wo — w* such that E$ := ||yo||z, 1 (R)ni? 4 (i[i) ^ sufficiently small. Then, 
for 1 < p < oo, 

\\(w - - y){t)\\ Lnm < Eo(l + t )-U±-Vp)-h+v. 

An important consequence of Proposition B.2 is that only the quadratic order quantities appear- 
ing in (B.27) need be taken into account in the study of asymptotic behavior of (B.26) to the order 
of approximation considered in Theorem 1.12. Finally, we note the following result following from 
a proof similar to but much simpler than the one for Proposition B.2 given in [26, Appendix A]. 

Lemma B.3. Let k satisfy k(0) = and 

(B.34) kt + ak x + (~fk 2 ) x - dk xx = (Fk) x , 

where a,j,d are constant, d > and F is a given function such that \\F(t) \\l 2 (m) < Fq(1 + t)~*. 
Then, for any rj > 0, provided Eq is small enough, for 1 < p < oo, 

\\k(t)\\L Pm ^E (l + t)-h(^M-h+v_ 
For the sake of completeness, we recall the proof of the previous Proposition in Appendix C. 



29 Computing ||0j(t)|kp(E) =t- 1 ^\\e j (-/Vt)\\Lv ( m ) ~ f^ 1 " 1 /"). 

30 Though stated in [26, Lemma 1.2] for scalar equations, the proof applies equally to the system case; see Appen- 
dix C. 
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B.2.2. A first application. As an immediate application, we may now establish the improved decay 
bounds (1.22)-(1.21) of Corollary 1.18. We will use these tools again in establishing (1.18). 

Proof of Corollary 1.18. Bound (1.21) follows from the assumption kd x ho = 0. For, a solution 
(Mw,kw), with an initial data (*,0), of the decoupled approximating equations (B.28) to (1.7) 
satisfies kw(t) = 0, since the k equation decouples in (B.28) for the linearly phase-decoupled case. 
Comparing to the actual solution of (1.7) using (B.32), we obtain the result. Bound (1.22) goes 
similarly, observing that in the quadratically decoupled case, the k equation in the full quadratic 
approximating system (B.27) to (1.7), though it does not completely decouple, is of the form (B.34) 
with F = 0(M). □ 

Remark B.4. Analogous to (B.34) in the quadratically decoupled case, the rate-determining bound 
in the linearly decoupled case of Proposition B.l is the key estimate 

\\k(t)\\L P <E (l + t)-^M-l 

established by Liu [40] for quadratic coupling terms involving different modes, thus obeying fc(0) = 
0, kt + ak x + ( / yk 2 ) x — dk xx = (6 2 ) x , where 9(x,t) = 6{x — at,bt) with a ^ a and 6 a self similar 
solution of a Burgers equation (B.31). The anomalous rate (1 + i)3 is different from the powers 
of (1 + i)2 arising in scalar convection-diffusion processes, reflecting the additional complications 
present in the system case. 

B.2.3. Quadratic approximants of modulation systems. For later reference, let us write, in the 
original frame (and not the co- moving one), as 

(B.35) dt^J + ^\^ m . ){MM ) + 2 dx {d 2 u; Km ((M,k),(M,k))) ~ B * d °\k 

the quadratic approximant of (1.7) (obtained as (B.16) and (B.25) above). As pointed out in 
Remark 4.11, it follows from Lemma 4.10 that this system is independent of the choices made in 
the course of the formal derivation. 

From the general theory, we know that instead of comparing (M, n) in Theorem 1.12 to a solution 
(A4w, k-w) of (1.7), we only need to compare it with (M, k) + {Myy, kw) with (Myy, kw) a solution 
of (B.35) expressed in the co-moving frame. 

B.3. Implicit change of variables. Our nonlinear analysis begins with an implicit nonlinear 
change of variable (2.5). We explain now how the modulation system is affected by this change 
of variables. We could have first performed this implicit change of variables then carried out the 
formal modulation process, but we find more enlightening to change the system a posteriori. 

Since our diffeomorphism is close to identity, only nonlinear terms should be changed, and from 
the asymptotic equivalence theory we know that nonlinear terms are relevant only in the hyperbolic 
part. Therefore it is enough to investigate how (B.5,B.8) is altered. Let us introduce $o such 
that $ (y (X,T),T) = X. Recall that d T ^o = u}(M ,d x ^o)- Therefore if A, B are such that 
d T A + d x B = then (A, B)(X, T) = (A, B)($ (X, T),T) implies 

_ |r$o + 1_ dxf} = Q 

dx<S>o dx<S>o 

also written = or 

d T (d x <5>oA) + d x (b - c (m , — y Aj = 

with A4o(A, T) = Mo($o(X, T),T). Note that this kind of manipulation is completely similar to 
the ones needed to perform usual Lagrangian change of coordinates and of course closely related 
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to the computations involved in the proof of Lemma 2.3. As expected, applying this to (B.5) leads 
to a trivial equation while an application on the trivial equation <9t(1) + d x (0) = gives 

dr(d x *o) - ^( c (^o,^)) = 0. 
Equation (B.5) is changed into 

dr (d x * Mo) + d x (f (Mo, - c (m , M 

At the hyperbolic level, we are thus lead to the system 

dTP ~ Ox (*{¥>$)) = 
9tM + dx (F(f,i)-c(%,l)M) = 

whose quadratic expansion in 

/ . ^ ( l &\ f-kl -kM M 

gives 

d T k-~kc(M,~k)d x k - ^x(^| ( ^ ) (M,A;) + ^ 2 W | ( ^ ) ((M,A ; ),(M,A;))^ 
d T M-kc(M,k)d x M + d x (dF m ^(M,k) + ±d 2 F m - k) ((M,k),(M,k))) 

+ kd x (dF^Mfy(M,k) — c(M,k)M^ — dc^^(M,k)M^j =0. 

Two main comments are in order: 1. We end up naturally with equations expressed in a co- moving 
frame thus no further change is needed. 2. The wavenumber equation remains unaltered at this 
level of description. This explains why the fact that the implicit change of variables could change 
the modulation equations was not revealed by previous studies [26, 57] focusing on situations where 
no other wave parameter is involved, 

Remark B.5. Though we do not need it for the present semilinear analysis, let us describe for the 
sake of generality what would happen for a full quasilinear parabolic system. For 

with 

dt* + Vi(M,*x) = D n+1 {M,y x )d x (^\ 
where A n+ \ = e n+ \ ■ A, D n+ \ = e n+ i • D, the transformation <I> = 'I' -1 , M = M o $ leads to 
fy* - $ x A n+1 (M, l/$ x ) = -D n+1 (M,l/$ x )d x 

with 

d t M + d x (A ± (MA/<l> x )-d x <f>A n+1 (M,l/$ x )M) 

= d x (^D ± (M,l/* x )d x (^Q - D n+1 (M,l/$ x )d x (^J Mj 

d t {d x $) - d x ($ x A n+1 (M,l/$ x )) = -d x (D n+1 (M,l/$ x )d x 
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o \ 



where A i = 



ld d 



xd 



A, D ± = 



ld d 



xd 



D. 



Collecting the results of this appendix, we find that to validate the formal Whitham modula- 
tion approximation, we only need to compare the couple (M,ktp x ) of Theorem 1.12 to a solution 

(Mw, kw) of 



M 
k 



ar» i+ kAd i M ) + ho (*fwfi)W,k),{M,k))\ _ - 2 ~ 2 (m 

°' [ ' 1 1 '• ' T {k ) + 2 kdx \(Pu m - k) ((M,k),(M,k)) 1 " b:<> 
■k 



+ ~kd x (l {dF\ { M; k) {M,k)-c{M~k)M) - dc m ; k) {M,k)M^ = ^ 
where A* = d(M,k)(F ~ ^M, —oj — ck)\^^y For writing convenience, we denote this system by 



(B.:!(>) i), ( ^ j + kA*d x (™ 



,2/ M 

x \ k 



Likewise, tp in Theorem 1.12 needs then to be compared with tpw a solution of 



Appendix C. Asymptotic equivalence of quadratic approximants 

For completeness, we include here a proof of Proposition B.2 including the treatment of off- 
diagonal diffusion terms not arising in the scalar case considered in [25]. 

Proof of Proposition B.2. (Case B* = B*.) We first review the case B* = B* treated in [25]. By 
the general results of [35], provided Eq := ||yo||L 1 niT 3 (iR) i s sufficiently small, we have for 1 < p < oo 

\\w(t) - wJ\ LPm , \\y(t)\\ LP{R) < E (l + ty* (1 -*\ 

3 

\\Wx(t)\\ H i(R), \\Vz(t)\\ H i(K) ^ E (l + t)~*. 

Setting 5 := iv* + y — w, we have, subtracting and rearranging, 

S t + AJ X - BJ XX = 8 X F, T = 0{{\w -wA + \y\)8) + 0(\w - w*\ 3 ) + 0{\w - w*|K|), 
with 5\t=o = and A* and B* as in (B.29)-(B.30). By Duhamel's formula, 



S(t) 



/ a(t — s)d x J 7 (s)ds, 
Jo 



where a is the solution operator of the parabolic system of conservation laws ut + A*u x — B*u xx = 0. 

Applying the standard bounds [41] \\a(t)d x h\\ LV {M.) < t 2( i p> 2 ||/i||l<7(r), 1 < q < p < oo, together 
with 



i(i-i/9)-i 



H\\m\\LHR) + \\Mt)\\LHR)) + E^(l + 
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1 < q < 2, we find, denning v(t) := sup < s < t ||5(a)||L2(R)(l+s)s (1 ~ 1/p)+ s~'', that, for all 1 < p < oo, 

ft/2 

II^)IIl, ( r) < / (t-sy-^M-zWT^h^ds 

Jo 

+ j\t - s) -|(V(min(2, ?) ))-l/ P )-I || J- (s) || Lmin(2 , p)(R)ds 
/•t/2 

< / (t- S )-^ (1 - 1/P ^2( l ,(t) J E + i? 2 )(l + S )- 1 +^ S 
JO 

+ S )-^ 1 /(min(2, P ))-l/p)-|^ (t)Eo + ^g )(1 + sr l + ,-I(l-l/(min(2,p))) ds 

Jt/2 

whence < C^o (#0 + This implies that z/(i) < 2(7^ for E < 1/(2C V ), giving 

\\s(t)\\Lnm < 2C v E 2 (i + ty^ 1 - 1 M-^, i< P <^. 



(General case.) We treat now the general case that B(w) = B* + 0{w — w*) with B* constant 
but not equal to 5*. Defining again 5 := w* + y — w, and denoting by a(t) the solution operator 
of linear system ut + A*u x — B*u xx = 0, we have by Duhamel's principle 

S(t) = (a - a)(t)w + / *(or - - s)a x O(|u; - w Sf \ 2 )(s)ds 
+ £ or(t - s)d x O(\6\(\w -w.\ + \y\))(s)ds 
+ Jq a(t — s)d x O(\w — w*\ 3 + \w — w*\\w x \)(s)ds. 
From [35], provided Eq := ||2/o || J L 1 n J H" 4 (M) * s sufficiently small, we have for 1 < p < oo 

IK*) - w4 LP(R) , \\y(t)\\ L p {R) < E (l + t)~^^\ 
\\w x (t)\\ H 2 {R) , \\y x (t)\\ H 2 {R) <E (l + t)~L 

Applying the bounds [35,41] \\(a-a)(t)d x h\\ LP{R} < f ^4)~5 (!+<)-§ \\h\\ Lm + e~ et ' = " 



1 < <7 < p < oo for r = 0, 1 (and some 9 > 0), estimating 

II / (cr - cr)(t - s)d x O(\w - w*\ 2 )(s)ds\\ LP(R) 
Jo 

ft/2 ft 

< / (t-s)-^ 1 - 1 M- 1 \\\w-w*\ 2 ( S )\\ Llm d S + / e- e ^\\\w-w*\ 2 (s)\\ LPm ds 
Jo Jo 

+ / (t - s)~2(l + t - s )~2 \\\ w - w*| 2 (s)|| LP ( R )ds 

Jt/2 

<£() / (t-s)"^ (1 " 1/p) " 1 (l + s)"5ds + J E / e -*( t -')(l + s)-3( 1 - 1 /P)-3d s 

Jo Jo 

ft 

+ E / (t - a)"! (1 + 1 - (1 + s)-^ 1 - 1 ^)-^ 

Jt/2 

<^ ) (l + i)-5 (1 - 1/l ' ) -5log(2 + t), 
and other terms either similarly or similarly as in the previous case, we obtain the result. □ 
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Appendix D. Generalizations 



We conclude in this appendix by describing briefly extensions to more general types of equations 
arising in applications, and the modifications in our arguments that are needed to accomplish this, 
discussing also, when possible, the verification of (H1)-(H3) and (D1)-(D3) in specific cases. 

D.l. Extensions in type: quasilinear and partially parabolic systems. Our analysis carries 
over in straightforward fashion to divergence-form systems of general quasilinear 2r-parabolic type. 
For example, the spectral preparation results of Lemma 1.5, Proposition 1.7, and Proposition 3.1 
all go through essentially as written, depending on no special structure other than divergence form. 
From these low- frequency /Bloch number descriptions, we obtain the same linear bounds on the 
critical modes s p as described here in the 2-parabolic semilinear case. The high-frequency and 
or high Bloch number analysis also go through unchanged, the former depending again only on 
the spectral preparation results and the latter depending only (through Priiss' Theorem) on high- 
frequency resolvent bounds following from (but not requiring) sectoriality of the linearized operator 
L about the wave. This completes the linear analysis. 

Likewise, by Remark 2.4, we obtain the useful representation (2.6) of the nonlinear perturbation 
equations stated in Lemma 2.3, with sources Q, 1Z, S of quadratic order in v, ip x , ipt, and a finite 
number of their derivatives, which was all that was needed for our nonlinear arguments. To obtain 
the nonlinear damping estimate of Proposition 2.5, we note that (2.14) becomes 

(1 - ip x )v t + (-l) r k 2r d x (B(U, dl r " 2 U)dl r - l v) = lower order terms, 

(—lVd^v 

U = U + v. Thus, taking the L 2 (R) inner product against Ylj=o — i T — > integrating by parts, 

and rearranging, we obtain jji\\v\\ 2 H K ( R }(t) < — 6\\dx +rv (t)\\ 2 L?(VL) + lower order terms, similarly as 
in the second-order semilinear case, leading thereby to 

j t \H 2 H K [m {t) < -e\\v(t)\\ 2 HK{R) + c (\\v(t)\\ 2 L2m + \\^t,^ x )(t)\\ 2 H K m ) 

and (by GronwalFs inequality) the result. See the proof of [5, Proposition 3.4], for full details in 
the fourth-order semilinear case. 

Combining these ingredients, we obtain, modulo an appropriate increase in the integer K encod- 
ing regularity requirements, stability, as stated in Theorem 1.10, and refined stability, as stated in 
Proposition 4.5, yielding the first part (1.16) of description of asymptotic behavior in Theorem 1.12. 
By Remark 4.15, we get also a partial version of the second part (1.17)— (1.19) of Theorem 1.12, 
but describing comparisons not to the Whitham system, but only to a second-order hyperbolic- 
parabolic system agreeing with the Whitham system in its linearization about the constant state 
(M,k). This in turn yields the conclusions of (1.21), Corollary 1.18, regarding decay with respect 
to localized perturbations for linearly phase-decoupled systems. 

Finally, to recover the full result (1.1 7) -(1.1 9) of Theorem 1.12, comparing to the exact Whitham 
system, and thus the sharpened decay rate (1.22) for localized data in the quadratically decoupled 
case, we have only to observe that performing the same computations as in Appendix A (differen- 
tiating the traveling- wave ODE), and in the proof of Lemma 4.9 (pulling out quadratic order parts 
of nonlinear term J\f) while carrying along the additional higher-order terms arising in the general 
case, we obtain a higher-order analog of Lemma 4.14, expressing the resulting quadratic coupling 
constants (means) in terms of derivatives of first-order terms arising in the Whitham system, after 
which computations go as before to yield the result; see the proof of Theorem 1.12, Section 4.6. 

This completes the treatment of the quasilinear 2r-parabolic case. Reviewing the above discus- 
sion, but omitting algebraic considerations on which we focus in the next section, we find that the 
two ingredients needed to treat more general divergence-form systems are the nonlinear damping 
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estimate used to control higher-derivative by lower-derivative norms, and the high-frequency lin- 
earized resolvent bounds used to apply Priiss' Theorem. For, these were the only two places where 
we used the parabolic form of the equations; the rest of the argument was completely general, 
Moreover, the second, linearized, estimate can typically be obtained by a linearized version of the 
same energy estimate that is used to obtain the first, damping-type estimate. This allows us, in 
particular, to treat (partially parabolic) symmetric hyperbolic-parabolic equations such as arise in 
continuum mechanics, using "Kawashima-type" energy estimates as described in [35], and variants 
thereof. See, for example, Proposition 4.4 (proved in Appendix A) and Lemma B.l in [33]. 

Remark D.l. The strategy of using a common energy estimate to get, simultaneously, damping 
high-frequency resolvent, and high-frequency decay estimates, with derivative gains in the first 
compensating for derivative losses in the third, originates in the study of viscous shock stability; 
see [76, Section 4.2.1]. For simpler, and somewhat sharpened, versions in this context, see [37, 45]. 

D.2. Extensions in form: an abstract continuum of models. Still more generally, we may 
treat the full class of systems 

(D.l) u t + f(u) x = g(u) + (B 1 (u)u x ) x + (B 2 (u, u x )u xx ) x + ... , 

u,f,g€ R n , Bj € R»*», with u = / = (fy, g = , B* = (^f ^fj , u 2 G W, 

including both divergence- and nondivergence-type equations. Note that this includes both reaction 
diffusion and conservation law cases as limits / = and g = 0, but also many cases in between: for 
example, the viscous relaxation case n = 2, r = 1 occurring for the Saint- Venant equations (1.28), 
or the case n = 3, r = 1 occurring for the Benard-Marangoni model (D.4) below. 

For such models, integrating the conservative u\ equation in the traveling-wave ODE, and writing 
as an N x TV first-order system, we obtain from the requirement of periodicity N constraints, while 
we have N + n — r + 2 degrees of freedom consisting of the initial condition u(0), the wave number 
k € M, the speed c£ K, and the constant of integration q\ £ M. n ~ r arising from integration of the 
til equation; thus, we expect generically a manifold of periodic solutions of dimension n — r + 2. 
In the reaction-diffusion case r = n, this returns the familiar value 2, or, up to translation, a one- 
dimensional family (generically) indexed by wave number k. In the conservation law case r = 0, it 
returns the value n + 2, leading, up to translation, to an (n+ l)-dimensional family as in hypothesis 
(H2) of the introduction. 

Substituting this value n — r + 1 in hypotheses (H2) and (D3), therefore, we readily obtain by the 
same derivation as for (1.7) a modified Whitham system consisting of the (n — r + 1) x (n — r + 1) 
system of viscous conservation laws 

M t + k(F-cM) x = k 2 (d 11 M x + d 12 K x ) x , 

Kt + k(-uj - ck) x = k {d 2 iM x + d 2 2K x )x, 

where uj(M, k) = —kc(A4, k) denotes time frequency, M := U^ 4,K (x)dx and 

F(M,k) := f (h(U M > K (x)) - Y,{BLB{ 2 ){U M ^\x)){U M n'{x))) dx 
Jo j 

denote mean and mean "total flux" in the u\ coordinate, and dij(M, k) are determined by higher- 
order corrections. (Note that, for B 3 = constant, the terms involving B 3 are perfect derivatives, so 
disappear; this explains the fact that they were not present in the discussion of the second-order 
semilinear case.) 

Likewise, we obtain in straightforward fashion analogs of the spectral preparation results of 
Lemma 1.5, Proposition 1.7, and Proposition 3.1, thus yielding corresponding linear bounds on 
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critical modes s p . Note that the slow decay rates that may arise at the linear level from a possible 
Jordan block will still be compensated by the special structure of the nonlinear terms, coming now 
in the form 



See for example [48] for a careful derivation of the second-order derivative Whitham system up to 
linear and quadratic order in first-order derivative terms, and [33] for a proof of the needed spectral 
preparation results in the Saint- Venant case (1.28). Indeed, so long as the nonlinear structure 
of the equations permits a nonlinear damping estimate as in Proposition 2.5, and high-frequency 
linearized resolvent estimates as needed to apply Priiss' Theorem in estimating high-frequency 
linearized behavior as in the proof of (3.18) and (3.19)) above, we obtain again (modulo increase 
in the exponent of regularity K) the stability results of Theorem 1.10 and Proposition 4.5, and 
a partial version of Theorem 1.12 describing comparisons to a second-order hyperbolic-parabolic 
system agreeing with (D.2) in its linearization about the constant state (M, k)., yielding again the 
result (1.21), of Corollary 1.18 asserting decay with respect to localized perturbations for linearly 
phase-decoupled systems. 

That is, we obtain in this case exactly the conclusions cited in the examples of the introduction, 
obtained by examination of the linearization of the first-order part of the Whitham equations. 

To recover the full result (1.17)— (1.19) of Theorem 1.12 showing convergence to the exact 
Whitham system, one also needs an analog of Lemma 4.14. But, the only difference between 
the (formal) computations of the derivation in Subsection B.1.2 and the ones of Lemma 4.14 is 
that the former are carried out before the implicit change of variables, while the latter are carried 
out after. Thus, analogs of Lemma 4.14 essentially follow by commutation of an implicit change of 
variables and expansions to a desired order. 

D.3. Verification of (HI) (H3), (Dl)— (D3). Regarding verification of our stability hypotheses, 
we recall that, assuming the trivial regularity hypothesis (HI), hypothesis (H2) is implied by (Dl)- 
(D3), by Lemma 1.6, while (H3) by Proposition 1.7 can generally be verified by the same spectral 
expansion process needed to verify (D2). Meanwhile, (D1)-(D3) can be verified numerically by 
Galerkin approximation or numerical Evans function computation (see, e.g., [18, 5, 6]), or, in some 
cases, analytically, using bifurcation theory (see, e.g., [60]) or singular perturbations (see, e.g., [27]). 
For general discussion, see [3, 33, 6]. 

D.4. Applications revisited. We now discuss previous examples and some new ones in a bit 
more depth. 

D.4.1. The Korteweg-de Vries/Kuramoto-Sivashinsky equation. A more canonical form of (1-26) is 

u t + jd*u + edlu + 5d 2 x u + d x f(u) = 0, 7, S > 0, 

modeling phenomena from plasma and flame-front instabilities to inclined thin-film flow [38, 63, 
54, 48]. As a fourth-order parabolic equation, this fits the framework of Section D.l, so that all of 
the results of this paper apply. Spectral stability has been studied in detail in [12, 5], indicating 
the existence of both spectrally stable and unstable waves; in particular, (H1)-(H3) and (Dl)- 
(D3) have been shown in [5] to hold for a wide variety of waves. We note that stability under 
these hypotheses has been proven for localized perturbations in [5]; the new observations here are 
asymptotic behavior, and decay for nonlocalized perturbations. 

D.4.2. The Saint- Venant equations. Recall, in Lagrangian coordinates, the Saint- Venant equations 




T t -u x = 0, 



(D.3) 



ut + ((2F)- 1 r^ 2 ) x = 1 - tu 2 + v(t- 2 u x ) 
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where r := h , h is fluid height, u is fluid velocity, and x is a Lagrangian marker. These are not 
parabolic, yet nonlinear damping and high-frequency resolvent estimates can still be carried out, 
yielding by the discussion of Section D.2 all of the results of this paper. Specifically, nonlinear 
damping is established in [33, Proposition 4.4] (proved in Appendix A of the reference), under 
the "slope condition" vu x < -F -1 , where U = (f,u), a technical condition that appears to hold 
in most cases of interest, but which we expect can be dropped. 31 Again, the new observation 
here is asymptotic behavior, and also stability under nonlocalized perturbations, stability under 
localized data having been established in [33]. As noted earlier, (D.3) is a balance law rather than 
a conservation law, with nonconservative source term g. 

D.4.3. The capillary Saint- Venant equations. With capillary pressure effects, (D.3) becomes 

T t -u x = 0, 

ut + ((2F)~ 1 r^ 2 ) a; = 1 - ru 2 + v{t- 2 u x ) x - a(r~ 5 r xx - | t~\t x ) 2 ) x 

where a > is the coefficient of capillarity. These equations can be reduced by Kotschote's [36] 
method of auxiliary variables (introducing z := t x ) to a 3 x 3 second-order quasilinear parabolic 
system 

7f U x + Z x = T xx , 
Zt = U xx , 

u t + ((2F)- 1 r^ 2 ) :c = 1 - tu 2 + u{t~ 2 u x ) x - a(T~ 5 z x - - t~ 6 z 2 ) x 

to which standard techniques can be applied [69, 74]. This fits the framework of Section D.l, 
yielding all of the results of this paper, the only change being in the regularity assumptions on 
data, which must be incremented by one to accommodate the new variable z = r x . Existence and 
spectral stability or instability of these waves is a topic of ongoing investigation [2]. 

D.4.4. Benard-Marangoni flow. A qualitative model introduced in [20] for Benard-Marangoni flow, 
or flow driven by temperature-induced surface tension variation, is 

ut = -(1 + Uxx)xx + e 2 u + f(u,v x ,w x ), 

(DA) v t = v xx + v x + g 1 (u,v,w) x , 

w t = w xx - w x + g 2 (u,v,w) x , 

with f(u,v x ,w x ) = —u 3 + j(uv x + uw x ), gi(u,v,w) = —uv, g2{u,v,w) = —uw. Though of mixed 
fourth-order parabolic/second-order parabolic form, it is readily seen that these equations are both 
sectorial and admit a nonlinear damping estimate; moreover, they are of the mixed conserva- 
tive/nonconservative form (D.l). Thus, by the discussion of Sections D.l and D.2, the main results 
of this paper apply, giving stability and behavior in terms of a 3 x 3 hyperbolic-parabolic system 
agreeing with the Whitham system (D.2). 

Let us now discuss existence, the form of the Whitham equations, and validation of (D1)-(D3). 
Setting v = w = 0, we find that the equations reduce to the Swift-Hohenberg equation (1.23) for 
u, with bifurcation parameter r = e 2 restricted to the positive side of the bifurcation point r = 
at which periodic solutions appear. Thus, we inherit from the Swift-Hohenberg equations a special 
class of periodic solutions with (v, w) vanishing. Up to translation, such solutions are given by the 
2-parameter family of zero-speed y^j-periodic Swift-Hohenberg solutions 

(D.5) U^ £ (x) = 2e ^ 1 ~ 4uj2 COS ((l + eu)x) + 0(e 2 ), (V u ' £ , W^ £ )(x) = (0, 0), 



6 Linearized analysis suggests that the sharp condition is, rather, some averaged version of this one, which holds 

trivially by the fact that perfect derivatives have zero mean [6]. 
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where e, recall, is the bifurcation parameter, a fixed constant in (D.4). However, there are many 
other solutions for which (V,W) ^ (0,0), yielding an additional two parameters in the description 
of nearby periodic traveling waves. Moreover, though the Swift-Hohenberg solutions are zero speed 
as a result of reflection symmetry (see Remark 1.27), reflection symmetry of (D.4) is broken as soon 
as (v,w) ^ (0,0), and so in general these waves may have arbitrary speed. It is our expectation, 
therefore, that the Whitham system is not phase-decoupled even about such special waves. 

Numerical experiment by Galerkin approximation in [70] indicate that solutions (D.5) satisfy 
stability conditions (D1)-(D3) for uj = and e > in a moderate range. Here, we demonstrate 
the same conclusion for |cj| < l/2\/3 and e << 1, using decoupling of the equations and known 
analytical results for the Swift-Hohenberg equation, at the same time obtaining the limiting e — > 
coefficients of the linearized Whitham system about (M,k) = {0,k). It would be interesting to 
carry out a systematic numerical stability investigation as in [4, 5] on the entire parameter range, 
and in particular to determine phase-coupling or decoupling of the associated Whitham system. 

Proof of (Dl)-(DS). About the special solutions (D.5), the linearized eigenvalue equations are 

An = L°u + Mv + Nw, 
(D.6) Xv = L + v, 

Xw = L~w, 

where Lq is the linearized operator of the Swift-Hohenberg equation about U and 

L± :=%±d x -d x U u ' e . 

By upper triangular form of (D.6), the eigenvalues of L^, counted by algebraic multiplicity, 
consist of the union of the eigenvalues of L° and L^. Let us first consider the eigenvalues of the 
Swift-Hohenberg operator L®. In [17, 13, 14] (see also [44, 60]) it was analytically verified 32 that 
for e « 1, solutions U^' e in (D.5) are spectrally stable for 

\Au 2 \< l - + 0(e) 

(in particular, for u = 0). From the fact that the waves are of speed c = 0, we find that the 
characteristic speed of the associated scalar Whitham equation is a = 0, and the associated 
critical mode has expansion A°(£) = —d ^ 2 . Turning to the operators L ± , and noting that O u),e — > 
uniformly in all derivatives as e — > 0, we find that as e — > their eigenvalues approach uniformly 
the eigenvalues of the limiting constant-coefficient operators 

Lf := (d x + i0 2 ±(d x + i£), 

which, by direct (discrete Fourier transform) computation, are 

A ± (0 = -(i + 2 ±^' + 0, 

where the Fourier frequency j runs through the integers. By continuity, these are therefore spec- 
trally stable for |e| << 1, with approximate critical mode expansions (obtained at j = 0) of 
±i£ — £ 2 . Combining these facts, we find that the limiting linearized Whitham system has charac- 
teristic speeds a,j = 0,±1, with corresponding (diagonal) viscosity coefficients d, 1, 1. This verifies 
(D1)-(D3) and (by distinctness of a,j) (H3) for |e| sufficiently small, yielding spectral stability by 
the discussion of Section D.3 □ 



^ It has also been shown numerically that there exist bands of stable periodic Swift-Hohenberg solutions in the 
parameter space (w,/t, e) [44, 4], for |e| not necessarily small. 
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D.4.5. Inclined Marangoni flow. The related inclined thin-film equation 
(D.7) H t + (H 2 - H% = -(H 3 H XXX ) X 

models Marangoni flow driven by a thermal gradient up an inclined silicon wafer, where H denotes 
fluid height [10, 11]. As a cousin of the Kuramoto-Sivashinsky equation, it would be interesting to 
investigate whether this model too supports stable periodic traveling-waves solutions. 

D.4.6. Surfactant-driven Marangoni flow. Finally, we mention the surfactant model [43] 
(Dg) H t + 1 -{H 2 a\T)T x ) x = Q, 

d t T + d x (THa'{T)d x T) = Pe" 1 ^, 

modeling flow in a thin horizontal film driven by surfactant induced gradients in surface tension, 
where H is fluid height and V surface surfactant concentration, and Pe s is the modified Peclet 
number, a dimensionless constant, and a(F) = 1 — F is an equation of state encoding the dependence 
of surface tension on surfactant density. Like (D.7), this appears to be an interesting example for 
study by the methods developed here and in [29, 30, 33, 5]. Note, as the second equation is 
conservative, that the associated Whitham approximation is indeed of system form. 
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